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NOVEL CARD PROTEINS INVOLVED IN 
CELL DEATH REGULATION 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

This invention relates generally to the fields of 
molecular biology and molecular medicine and more 
specifically to the identification of proteins involved 
in programmed cell death and associations of these 
proteins . 

BACKGROUND INFORMATION 

Programmed cell death is a physiologic process that 
ensures homeostasis is maintained between cell production 
and cell turnover in essentially all self-renewing 
tissues. In many cases, characteristic morphological 
changes, termed "apoptosis, " occur in a dying cell. 
Since similar changes occur in different types of dying 
cells, cell death appears to proceed through a common 
pathway in different cell types. 

In addition to maintaining tissue homeostasis, 
apoptosis also occurs in response to a variety of 
external stimuli, including growth factor deprivation, 
alterations in calcium levels, free-radicals, cytotoxic 
lymphokines, infection by some viruses, radiation and 
most chemotherapeutic agents. Thus, apoptosis is an 
inducible event that likely is subject to similar 
mechanisms of regulation as occur, for example, in a 
metabolic pathway. In this regard, dysregulation of 
apoptosis also can occur and is observed, for example, in 
some types of cancer cells, which survive for a longer 



time than corresponding normal cells, and in 
neurodegenerative diseases where neurons die prematurely. 
In viral infections, induction of apoptosis can figure 
prominently in the pathophysiology of the disease 
process, because immune-based eradication of viral 
infections depends on elimination of virus-producing host 
cells by immune cell attack resulting in apoptosis. 

Some of the proteins involved in programmed cell 
death have been identified and associations among some of 
these proteins have been described. However, additional 
apoptosis regulating proteins remain to be found and the 
mechanisms by which these proteins mediate their activity 
remains to be elucidated. The identification of the 
proteins involved in cell death and an understanding of 
the associations between these proteins can provide a 
means for manipulating the process of apoptosis in a cell 
and, therefore, selectively regulating the relative 
lifespan of a cell or its relative resistance to cell 
death stimuli. 

The principal effectors of apoptosis are a family of 
intracellular proteases known as Caspases, representing 
an abbreviation for Cysteine Aspartyl Prote ases . 
Caspases are found as inactive zymogens in essentially 
all animal cells. During apoptosis, the caspases are 
activated by proteolytic processing at specific aspartic 
acid residues, resulting in the production of subunits 
that assemble into an active protease typically 
consisting of a heterotetramer containing two large and 
two small subunits (Thornberry and Lazebnik, Science 
281:1312-1316 (1998)). The phenomenon of apoptosis is 
produced directly or indirectly by the activation of 
caspases in cells, resulting in the proteolytic cleavage 
of specific substrate proteins. Moreover, in many cases, 



caspases can cleave and activate themselves and each 
other, creating cascades of protease activation and 
mechanisms for M auto "-acti vat ion . 

5 i\mong the substrates of caspases are the 

intracellular proforms of cytokines such as pro- 

Interleukin-lp (pro-IL-l)3) and pro-IL-18. When cleaved 
by caspases, these pro-proteins are converted to the 
biologically active cytokines which are then liberated 

10 from cells, circulating in the body and eliciting 

inflammatory immune reactions. Thus, caspases can be 
involved, in some instances, in cytokine activation and 
responses to infectious agents, as well as inflammatory 
and autoimmune diseases. Caspases also participate in 

15 signal transduction pathways activated by some cytokine 
receptors, particularly members of the Tumor Necrosis 
Factor (TNF) family of cytokine receptors which are 
capable of activating certain caspase zymogens. 

2 0 Thus, knowledge about the proteins having domains 

that interact with and regulate caspases is important for 
devising strategies for manipulating cell life and death 
in therapeutically useful ways. The identification of 
such proteins that contain caspase-interacting domains 

25 and the elucidation of the proteins with which they 

interact, therefore, can form the basis for strategies 
designed to modulate apoptosis, cytokine production, 
cytokine receptor signaling, and other cellular 
processes. Thus a need exists to identify proteins that 

30 interact with caspases and other apoptosis related 

proteins* The present invention satisfies this need and 
provides additional advantages as well. 
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SUMMARY OF THE INVENTION 

In accordance with the present invention, there are 
5 provided novel ^NB-ARC and CARD"-containing proteins, 
designated NAC, as well as several isoforms of NAC 
produced by alternative mRNA splicing. The invention 
also provides nucleic acid molecules encoding NAC and its 
isoforms, vectors containing these nucleic acid molecules 
10 and host cells containing the vectors. The invention 
also provides antibodies that can specifically bind to 
NAC proteins, including alternative isoforms thereof. 

The present invention also provides a screening 
15 assay useful for identifying agents that can effectively 
alter the association of NAC with itself or with other 
proteins. By altering the self-association of NAC or by 
altering their interactions with other proteins, an 
effective agent may increase or decrease the level of 
20 caspase proteolytic activity or apoptosis in a cell, or 

it may increase or decrease the levels of NF-kB, cytokine 
production, or other events. 

The invention also provides methods of altering the 
25 activity of NAC in a cell, wherein such increased or 
decreased activity of NAC can modulate the level of 
apoptosis or other cellular responses. For example, the 
activity of NAC in a cell can be increased by introducing 
into the cell and expressing a nucleic acid sequence 
30 encoding these proteins. In addition, the activity of 
NAC in a cell can be decreased by introducing into the 
cell and expressing a fragment of NAC, or an antisense 
nucleotide sequence that is complementary to a portion of 
a nucleic acid molecule encoding the NAC proteins. 
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The invention also provides methods for using an 
agent that can specifically bind NAC or a nucleotide 
sequence that can bind to a nucleic acid molecule 
encoding NAC to diagnose a pathology that is 
5 characterized by an altered level of apoptosis due to an 
increased or decreased level of NAC in a cell. 

BRIEF DESCRIPTION OF THE FIGURES 

10 Figure 1A shows the cloning strategy for NAC and Isoforms 
of NAC. The NB-ARC domain (filled box), leucine-rich 
repeats (LRR, filled bars) , and the CARD domain (labeled 
box) are depicted. Relevant restriction sites (RI for 
EcoRI, X for Xho I) are indicated* Positions for forward 

15 PCR primers (IF, 2F, and 3F) and reverse primers (1R, 2R, 
and 3R) which were used for Reverse Transcriptase- 
Polymerase Chain Reaction cloning of NAC and NAC-i so forms 
are shown. 

20 Figure IB shows multiple isoforms of NAC. Isoforms of 
NAC are generated by alternative mRNA splicing, based on 
cDNA cloning results. The same symbols as in Figure 1A 
are used. Two alternatively spliced exons are shown as 
dotted boxes and hatched boxes, respectively. Note that 

25 longer and shorter versions of the CARD domain are 

produced (CARD L and CARDJ . The four resultant isoforms 
are described as NAC a, NAC(3, NACy and NAC5. 

Figure 1C shows the cDNA and amino acid sequence of the 
30 longest NAC isoform (also set for in SEQ ID NOs : 1 and 2). 
The nucleotide sequences of the two alternatively spliced 
exons (nucleotides 2870-2959 , and 3784-3915, 
respectively, and amino acids 918-947 and 1262-1305) are 
underlined. The positions for the P-loop (Walker A) and 
35 Walker B of NB-ARC domain are indicated. The LRR repeats 



are in bold letters (amino acids 808-948), and the CARD 
domain is in bold underlined letters (amino acids 
1373-1473) . 

Figure ID shows a sequence analysis of NAC: NB-ARC 
homology. Alignment of the NB-ARC domains of human NAC 
(amino acids 329-547), CARD 4 (amino acids 197-408), and 
Apaf-1 (amino acids 138-352), and Caenorhabditis elegans 
CED4 (amino acids 154-374) . Alignment was conducted 
using Clustal method (Thompson et al . , Nuc. Acids Res. 
22:4673-4680 (1994)). Identical and similar residues are 
shown in black and gray, respectively. 

Figure IE shows alignment of CARD domain of NAC and other 
CARD-containing proteins. Alignment was conducted using 
Clustal method. Identical and similar residues are shown 
in black and gray, respectively. 

Figure 2 shows self-association of Long and Short CARD 
domains of NAC. (A) For in vitro binding assays, 
purified GST fusion proteins immobilized on GSH-sepharose 
containing CARD L (lane 3), CARD S (lane 4), or GST alone 
(lane 2) were incubated with 35 S-labeled, in vitro 
translated CARD L (top panel), CARD S (middle panel), or 
control protein Skp-1 (bottom panel) . In vitro 
translation mix (one tenth of input, lane 1) was directly 
loaded as control. (B) Homophilic interactions of CARD. 
In vitro translated Apaf-1 ( -WD) (top panel), CED4 
(middle panel), or control Skp-1 (bottom panel) proteins 
were incubated with GST (lane 2), GST-CARD L (lane 3), and 
GST-CARD S (lane 4) immobilized on GSH-sepharose beads. In 
lane 1, one tenth of input 35 S proteins are shown. 

Figure 3 shows homophilic interactions of CARD domains 
detected by yeast two-hybrid method. Yeast cells were 
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co-transformed with plasmids encoding the indicated 
proteins fused to LexA DNA binding domain (LexA) and 
proteins fused to B42 transactivation domain (B42) . 
Transformants were replica-plated on leucine-supplemented 
5 plates (Leu+) and leucine-def icient plates (Leu-) to 
assess protein interactions. p-galactosidase activity 
(LacZ) was measured for each transf ormant, and were 
scaled as: absent (-) , weak (+/-), detectable (+) , strong 
(++), very strong (+++), and strongest (++++). 

0 

Figure 4 shows self-association of NB-ARC domain of NAC • 
In vitro translated, 35 S-labeled rat reticulocyte lysates 
(1 ill) containing NB-ARC (lanes 2 and 3) or Skp-1 (as a 
control; lanes 5 and 6) were incubated with purified 
5 GST-NB-ARC (lanes 3 and 6) or GST alone (lanes 2 and 5) 
immobilized on GSH-sepharose beads for in vitro binding 
assays. In lanes 1 and 4, one tenth of input 35 S proteins 
are shown. 

Figure 5 shows that NAC forms complexes with Apaf-1 and 
CED4 . (A) Complex formation with human Apaf-1. 293T 
cells were transiently transfected with an expression 
plasmid encoding HA-tagged human Apaf-1 lacking the C- 
terminal WD repeats [HA-Apaf - 1 (A WD) ] in the presence 
(lanes 2 and 3) or absence (lane 1) of a plasmid encoding 
myc-tagged full-length NAC (myc-NAC) . Transfected cells 
were lysed and subjected to immunoprecipitation (IP) with 
either a mouse monoclonal antibody to myc (lanes 1 and 3) 
or a control mouse IgG (lane 2) . Proteins from the 
immune complexes were resolved by SDS-PAGE, transferred 
to nitrocellulose, and subjected to immunoblot analysis 
(WB) using anti-HA antibodies (bottom panel) followed by 
anti-myc antibodies (top panel) . One tenth of the total 
cell lysates derived from each transfection were loaded 
directly in the gel as a control (Lysate) . (B) Complex 



formation with C. elegans CED4 protein. Identical 
procedures and conditions described for Apaf-1 in (A) 
were employed for CED4 interaction studies with NAC . 

Figure 6 shows that NAC interacts with pro-Casp8, but not 
pro-Casp9. (A) Interaction with pro-Casp8. 293T cells 
were transfected with an expression plasmid encoding HA- 
tagged human pro-Casp8 [HA-Casp8 (C/A) ] , which harbors 
an alanine replacement of the catalytic cysteine residue, 
in the presence (lanes 2 and 3) or absence (lane 1) of 
myc-NAC expression plasmid. Transfected cells were lysed 
and subjected to immunoprecipitation (IP) with either 
anti-myc antibodies (lanes 1 and 3) or a control antibody 
(lane 2) . The immunoprecipitated proteins were resolved 
by SDS-PAGE, transferred to nitrocellulose, and analyzed 
by iimunoblotting (WB) for pro-Casp8 (bottom panel) using 
anti-HA antibodies or for NAC (top panel) using anti-myc 
antibodies. One tenth of the total cell lysates of each 
transfection was loaded directly in gels as a control 
(Lysate) . (B) Interaction with pro-Casp9. Identical 
procedures and conditions described for Casp8 were used 
for Casp9 interaction studies with NAC. The Casp9 
expression plasmid [Flag-Casp9 (C/A)] contains a C- 
terminal Flag-tagged form of pro-Casp9 harboring an 
alanine replacement of the catalytic cysteine residue. 
The immunoblots were probed for Casp9 (bottom panel) 
using a rabbit anti-Casp9 polyclonal antibody derived 
against GST-Casp9 fusion proteins. 



DETAILED DESCRIPTION OF THE INVENTION 



In accordance with the present invention, there are 
provided "substantially pure" mammalian CARD-containing 
proteins, designated NAC and CARD-X . As used herein, the 
term "NAC" refers to a protein that contains both an 



NB-ARC domain and a CARD domain (NAC) . The invention NAC 
proteins represent novel members of the "CARD domain" 
family of proteins, which family includes CED-4 and 
Apaf-1. An invention NAC comprises a NB-ARC domain and a 
CARD domain, and optionally further comprises a 
leucine-rich repeat domain and/or a TIM-Barrel-like 
domain . 

As used herein, the term "CARD domain" refers to a 
Caspase Recruitment Domain (Hofmann et al., Trends 
Biochem. Sci. 22:155-156 (1997)). CARD domains have been 
found in some members of the Caspase family of cell death 
proteases. Caspases-1, 2, 4, 5, 9, and 11 contain CARD 
domains near their NH 2 -termini. These CARD domains 
mediate interactions of the zymogen inactive forms of 
caspases with other proteins which can either activate or 
inhibit the activation of these enzymes. For example, 
the CARD domain of pro-caspase-9 binds to the CARD domain 
of a caspase-activating protein called Apaf-1 (Apoptosis 
Protease Activating Factor-1) . Similarly, the CARD 
domain of pro-caspase-1 permits interactions with another 
CARD protein known as Cardiac (also referred to as R1P2 
and RICK) , which results in activation of the caspase-1 
protease (Thome et al., Curr. Biol. 16:885-888 (1998)). 
And, pro-caspase-2 binds to the CARD protein Raidd (also 
know as Cradd) , which permits recruitment of 
pro-caspase-2 to Tumor Necrosis Factor (TNF) Receptor 
complexes and which results in activation of the 
caspase-2 protease (Ahmad et al . , Cancer Res. 57:615-619 
(1997)). CARD domains can also participate in homotypic 
interactions with themselves, resulting in 
self-association of proteins that contain these 
protein-interaction domains and producing dimeric or 
possibly even oligomeric complexes. 
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CARD domains can be found in association with other 
types of functional domains within a single polypeptide, 
thus providing a mechanism for bringing a functional 
domain into close proximity or contact with a target 
protein via CARD: CARD associations involving two 
CARD-containing proteins. For example, the 
Caenorhabiditis elegans cell death gene ced-4 encodes a 
protein that contains a CARD domain and a ATP-binding 
oligomerization domain called an NB-ARC domain (van der 
Biezen and Jones Curr Biol 8:R226-R227) . The CARD domain 
of the CED-4 protein interacts with the CARD domain of a 
pro-caspase called CED-3. The NB-ARC domain allows CED-4 
to self-associate, thereby forming an oligomeric complex 
which brings associated pro-CED-3 molecules into close 
proximity to each other. Because most pro-caspases 
possess at least a small amount of protease activity even 
in their unprocessed form, the assembly of a complex that 
brings the proforms of caspase into juxtaposition can 
result in trans-processing of zymogens, producing the 
proteolytically processed and active caspase. Thus, 
CED-4 employs a CARD domain for binding a pro-caspase and 
an NB-ARC domain for self-oligomeri zation, resulting in 
caspase clustering, proteolytic processing and 
activation . 

Numerous CED-4-related proteins have recently been 
identified. These proteins belong to the CED-4 family of 
proteins, and include CED-4 (Yuan and Horvitz, 
Development 116:309-320 (1992)), Apaf-1, (Zou et al., 
Cell 90:405-413 (1997)), Dark (Rodriguez et al . , Nature 
Cell Biol. 1:272-279 (1999)), and CARD4/Nodl (Bertin et 
al., J. Biol. Chem. 274:12955-12958 (1999) and Inohara et 
al., J. Biol. Chem. 274:14560-14567 (1999)). As used 
herein, a CED-4 family member is a protein that comprises 
a NB-ARC domain and a CARD domain. 
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The CED-4 homolog in humans and rodents, referred to 
as Apaf-1, has been found to function similarly. The 
Apaf-1 protein contains a (i) CARD domain, (ii) NB-ARC 
domain, and (iii) multiple copies of a WD-repeat domain. 
In contrast to CED-4 which can spontaneously oligomerize, 
the mammalian Apaf-1 protein is an inactive monomer until 
induced to oligomerize by binding of a co-factor protein, 
cytochrome c (Li et al . , Cell 91:479-489 (1997)). In 
Apaf-1, the WD repeat domains prevent oligomerization of 
the Apaf-1 protein, until coming into contact with 
cytochrome c. Thus, the WD-repeats function as a 
negative-regulatory domain that maintains Apaf-1 in a 
latent state until cytochrome c release from damaged 
mitochondria triggers the assembly of an oligomeric 
Apaf-1 complex (Saleh, J. Biol. Chem. 274:17941-17945 
(1999) ) . By binding pro-caspase-9 through its CARD 
domain, Apaf-1 oligomeric complexes are thought to bring 
the zymogen forms of caspase-9 into close proximity, 
permitting them to cleave each other and produce the 
proteolytic processed and active caspase-9 protease (Zou 
et al., J. Biol. Chem. 274:11549-11556 (1999)). 

In addition to their role in caspase-activation, 
CARD domains have been implicated in other cellular 
processes. Some CARD-containing proteins, for example, 
induce activation of the transcription factor NF-kB. 
NF-kB activation is induced by many cytokines and plays 
an important role in cytokine receptor signal 
transduction mechanisms (DiDonato et al . , Nature 388:548- 
554 (1997)). Moreover, CARD domains are found in some 
proteins that inhibit rather than activate caspases, such 
as the IAP (Inhibitor of Apoptosis Protein) family 
members, cIAPl and cIAP2 (Rothe et al . , Cell 83:1243-1252 
(1995) ) and oncogenic mutants of the Bcl-10 protein 



(Willis et al., Cell 96:35-45 (1999)). Also, though 
caspase activation resulting from CARD domain 
interactions is often involved in inducing apoptosis, 
other caspases are primarily involved in proteolytic 
processing and activation of inflammatory cytokines (such 
as pro-IL-10 and pro-IL-18) . Thus, CARD-containing 
proteins can also be involved in cytokine production, 
thus regulating immune and inflammatory responses. 

In view of the function of the CARD domain within 
invention NAC proteins, invention NAC proteins or CARD- 
domain containing fragments thereof, are contemplated 
herein for use in methods to modulate apoptosis, cytokine 
production, cytokine receptor signaling, and other 
cellular processes. Invention NAC proteins or CARD- 
domain containing fragments thereof are also contemplated 
in methods to identify CARD-binding agents that modulate 
apoptosis, cytokine production, cytokine receptor 
signaling, and other cellular processes. 

In one embodiment, a CARD domain of an invention NAC 
comprises a sequence with at least 50% identity to the 
CARD domain of NAC (see, e.g., residues 1373-1473 of SEQ 
ID NO: 2) . More preferably, a CARD domain of the 
invention comprises a sequence with at least 60% identity 
to the CARD domain of NAC. Most preferably, a CARD 
domain of the invention comprises a sequence with at 
least 75% identity to the CARD domain of NAC. Typically, 
a CARD domain of the invention comprises a sequence with 
at least 95% identity to the CARD domain of NAC. 

As described herein, invention NAC or CARD-X 
proteins can associate with other CARD-containing 
proteins. In particular, the association of the CARD 
domain of invention NAC proteins with another 



CARD-containing protein, such as Apaf-1, CED-4, caspases- 
1, 2, 9, 11, cIAPs-1 and 2, CARDIAK, Raidd, Dark, CARD 4 , 
and other NAC or CARD-X, and the like, is sufficiently 
specific such that the bound complex can form in vivo in 
a cell or in vitro under suitable conditions. Similarly 
therefore, an invention NAC protein can associate with 
another NAC protein by CARD: CARD association. 

A NAC protein of the invention further can associate 
with pro-caspases, caspases or with caspase-associated 
proteins, thereby modulating caspase proteolytic 
activity. Caspase proteolytic activity is associated 
with apoptosis of cells, and additionally with cytokine 
production. Therefore, an invention NAC can modulate 
apoptosis or cytokine production by modulating caspase 
proteolytic activity. As used herein a "caspase" is any 
member of the cysteine aspartyl proteases that associates 
with a NAC protein of the invention or with a NAC 
associated protein. Similarly, a "pro-caspase" is an 
inactive or less-active precursor form of a caspase, 
which is typically converted to the more active caspase 
form by a proteolytic event. 

CARD-containing proteins are also known to induce 
activation of the transcription factor NF-kB. Thus, an 
invention NAC can also modulate transcription by 
modulation of NF-kB activity. 

A NAC protein of the invention also comprises a 
NB-ARC domain. As described herein, a NB-ARC domain of 
the invention NAC protein comprises a sequence wherein 
the identity of residues in either the P-Loop (Walker A) 
or Walker B regions is at least 60% relative to the 
residues of NAC (see, e.g., residues 329-343 and 407-412 
of SEQ ID NO:2; see Figure 1C) . Preferably, an NB-ARC 
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domain of the invention NAC comprises a sequence wherein 
the overall identity of residues in the P-Loop (Walker A) 
and Walker B regions is at least 60% relative to the 
residues of NAC. More preferably, an NB-ARC domain of 
the invention comprises a sequence with at least 60% 
identity to the entire NB-ARC domain of NAC (see, e.g., 
residues 329-547 of SEQ ID NO : 2 ) . Most preferably, an 
NB-ARC domain of the invention comprises a sequence with 
at least 80% identity to the entire NB-ARC domain of NAC. 

The NB-ARC domain of the invention NAC proteins 
associates with other proteins, particularly with 
proteins comprising NB-ARC domains. Thus, a functional 
NB-ARC domain associates with NB-ARC domain-containing 
proteins by way of NB-ARC :NB-ARC association. As used 
herein, the term "associate" or "association" means that 
NAC can bind to a protein relatively specifically and, 
therefore, can form a bound complex. In particular, the 
association of the NB-ARC domain of NAC with another 
NB-ARC domain-containing proteins is sufficiently 
specific such that the bound complex can form in vivo in 
a cell or in vitro under suitable condition. Further, a 
NB-ARC domain demonstrates both nucleotide-binding (e.g., 
ATP-binding) and hydrolysis activities, which is 
typically required for its ability to associate with 
NB-ARC domain-containing proteins. Thus, an NB-ARC 
domain of the invention NAC comprises one or more 
nucleotide binding sites. As used herein, a nucleotide 
binding site is a portion of a protein that specifically 
binds a nucleotide such as, e.g., ATP, and the like. 
Typically, the nucleotide binding site of NB-ARC will 
comprise a P-loop, a kinase 2 motif, or a kinase 3a motif 
of the invention NAC (these motifs are defined, for 
example, in van der Biezen and Jones, supra) . 
Preferably, the nucleotide binding site of NB-ARC 



comprises a P-loop of the invention NAC . 

An invention NAC, therefore, is capable of CARD: CARD 
association and/or NB-ARC : NB-ARC association, resulting 
in a multifunctional protein capable of one or more 
specific associations with other proteins. An invention 
NAC can modulate cell processes such as apoptosis, 
cytokine production, and the like. For example, it is 
contemplated herein that an invention NAC protein can 
increase the level of apoptosis in a cell. It is also 
contemplated herein that an invention NAC can decrease 
the level of apoptosis in a cell. For example, a NAC 
which does not induce apoptosis may form hetero-oligomers 
with a NAC which is apoptotic, thus interfering with the 
apoptosis-inducing activity of NAC. 

In another embodiment of the invention the NAC 
protein of the invention also contains Leucine-Rich 
Repeats (LRR) domain, similar to a LRR described in 
another CARD protein known as CARD 4 (also known as Nodi) 
(Inohara et al., J. Biol, Chem. 274:14560-14567 (1999)). 
Unlike CARD-4 (Nodi), however, the CARD domain of NAC is 
located at the Carboxyl end of the protein whereas the 
CARD domain of CARD-4 (Nodi) is found at the NH 2 -end of 
the protein. The function of the LRR domain is to mediate 
specific interactions with other proteins. 

As used herein, leucine-rich repeat (LRR) domain of 
the invention NAC comprises a sequence with at least 50% 
identity to the LRR domain of NAC (see, e.g., residues 
808-948 of SEQ ID NO:2). Preferably, a LRR domain of the 
invention NAC comprises a sequence with at least 60% 
identity to the LRR domain of NAC. More preferably, a 
LRR region of the invention NAC comprises a sequence with 
at least 75% identity to the LRR domain of NAC. Most 
preferably, a LRR region of the invention NAC comprises a 
sequence with at least 95% identity to the LRR domain of 



NAC. 

It is further contemplated herein that a shortened 
LRR of the invention NAC may be used* A shortened LRR of 
the invention comprises a sequence with at least 90% 
identity to the splice variant form of the LRR (see, 
e.g., residues 808-917 of SEQ ID N0:2), and does not 
contain more than 90% of the residues in the splice 
region (see, e.g., residues 918-947 of SEQ ID NO:2). 
Preferably, the shortened LRR does not contain more than 
70% of the residues in the splice region. More 
preferably, the shortened LRR does not contain more than 
50% of the residues in the splice region. The shortened 
LRR will be of particular utility when the 
protein: protein interaction activity of a NAC comprising 
a shortened LRR differs from that observed for a NAC 
comprising the full-length LRR. Activity of a NAC with a 
shortened LRR will be determined by one or more of the 
assays disclosed herein, and shall be considered to 
differ from that of a NAC comprising the full-length LRR 
if any protein : protein interactions are altered by 10% or 
more, or if caspase activity or apoptotic activity is 
altered by 10% or more. 

In a further embodiment of the invention, invention 
NAC proteins contain a TIM-Barrel-like domain with 
similarity to TIM-barrel proteins. TIM-Barrel domains 
are well known in the art and typically consist of eight 
alternating a-helices and p-strands forming a barrel-like 
structure, but may contain 7 a-helices and/or P-strands 
in some instances. TIM-barrels have been found in some 



enzymes, such as aldolase, but also mediate protein 
interactions in some instances. 

As used herein, a TIM-Barrel-like domain of an 
invention NAC comprises a sequence with at least 50% 
identity to the TIM-Barrel-like domain of NAC (residues 
1079-1320 of SEQ ID NO : 2 ) . Preferably, a TIM-barrel-like 
domain of the invention NAC comprises a sequence with at 
least 60% identity to the TIM-Barrel-like domain of NAC. 
More preferably, a TIM-barrel domain of the invention NAC 
comprises a sequence with at least 70% identity to the 
TIM-barrel-like domain of NAC. Most preferably, a 
TIM-barrel-like domain of the invention NAC comprises a 
sequence with at least 80% identity to the TIM-barrel- 
like domain of NAC. 

Presently preferred NAC proteins of the invention 
include proteins that comprise substantially the same 
amino acid sequences as the protein sequence set forth in 
SEQ ID NOs:2, 4, and 6, as well as biologically active, 
functional fragments thereof. 

Those of skill in the art will recognize that 
numerous residues of the above-described sequences can be 
substituted with other, chemically, sterically and/or 
electronically similar residues without substantially 
altering the biological activity of the resulting NAC 
protein species. In addition, larger polypeptide 
sequences containing substantially the same sequence as 
amino acids set forth in SEQ ID NOs:2, 4, and 6, therein 
are contemplated. 

As employed herein, the term "substantially the same 
amino acid sequence" refers to amino acid sequences 
having at least about 70% identity with respect to the 
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reference amino acid sequence, and retaining comparable 
functional and biological activity characteristic of the 
protein defined by the reference amino acid sequence* 
Preferably, proteins having "substantially the same amino 
acid sequence" will have at least about 80%, more 
preferably 90% amino acid identity with respect to the 
reference amino acid sequence; with greater than about 
95% amino acid sequence identity being especially 
preferred. It is recognized, however, that polypeptides 
(or nucleic acids referred to hereinbefore) containing 
less than the described levels of sequence identity 
arising as splice variants or that are modified by 
conservative amino acid substitutions, or by substitution 
of degenerate codons are also encompassed within the 
scope of the present invention. 

The term "biologically active" or "functional", when 
used herein as a modifier of invention NACs, or 
polypeptide fragments thereof, refers to a polypeptide 
that exhibits functional characteristics similar to a 
NAC. Biological activities of NAC are, for example, the 
ability to bind, preferably in vivo, to a CARD-containing 
protein or a NB-ARC-containing protein, or to 
homo-oligomerize, or to modulate protease activation, 
particularly caspase activation, or to modulate NF-kB 
activity, or to modulate apoptosis, as described herein. 
Such NAC binding activity can be assayed, for example, 
using the methods described herein. Another biological 
activity of NAC is the ability to act as an immunogen for 
the production of polyclonal and monoclonal antibodies 
that bind specifically to an invention NAC. Thus, an 
invention nucleic acid encoding NAC will encode a 
polypeptide specifically recognized by an antibody that 
also specifically recognizes a NAC protein (preferably 
human) including the amino acid set forth in SEQ ID 



N0s:2, 4, 6, 10 or 12. Such immunologic activity may be 
assayed by any method known to those of skill in the art. 
For example, a test-polypeptide encoded by a NAC cDNA can 
be used to produce antibodies, which are then assayed for 
their ability to bind to an invention NAC protein 
including the sequence set forth in SEQ ID NOs:2, 4, 6, 
10 or 12. If the antibody binds to the test-polypeptide 
and the protein including the sequence encoded by SEQ ID 
NOs:2, 4, 6, 10 or 12 with substantially the same 
affinity, then the polypeptide possesses the requisite 
immunologic biological activity. 

As used herein, the term "substantially purified" 
means a protein that is in a form that is relatively free 
from contaminating lipids, proteins, nucleic acids or 
other cellular material normally associated with a 
protein in a cell. A substantially purified NAC can be 
obtained by a variety of methods well-known in the art, 
e.g., recombinant expression systems described herein, 
precipitation, gel filtration, ion-exchange, 
reverse-phase and affinity chromatography, and the like. 
Other well-known methods are described in Deutscher et 
al . , Guide to Protein Purification: Methods in 
Enzymol ocrv Vol . 182, (Academic Press, (1990)), which is 
incorporated herein by reference. Alternatively, the 
isolated polypeptides of the present invention can be 
obtained using well-known recombinant methods as 
described, for example, in Sambrook et al . , supra . , 
(1989) . 

In addition to the ability of invention NAC 
proteins, or fragements thereof, to interact with other, 
heterologous proteins (i.e., NB-ARC and CARD-containing 
proteins), invention NAC and CARD-X proteins have the 
ability to self-associate. This self -association is 



20 

possible through interactions between CARD domains, and 
also through interactions between NB-ARC domains. 
Further, self-association can take place as a result of 
interactions between LRR and TIM-Barrel-like domains. 

In accordance with the invention, there are also 
provided mutations and fragments of NAC which have 
activity different than a wild type NAC activity. As 
used herein, a "mutation" can be any deletion, insertion, 
or change of one or more amino acids in the wild type 
protein sequence, and a "fragment" is any truncated form, 
either carboxy- terminal , amino-terminal, or both, of the 
wild type protein. Preferably, the different activity of 
the mutation or fragment is a result of the mutant 
protein or fragment maintaining some but not all of the 
activities of wild type NAC. For example, a fragment of 
NAC can contain a CARD domain and LRR and TIM-Barrel-like 
domains, but lack a functional NB-ARC domain. Such a 
fragment will maintain a portion of the wild type NAC 
activity (e.g., CARD domain functionality), but not all 
wild type activities (e.g., lacking an active NB-ARC 
domain) . The resultant fragment will therefore have 
activity different than wild type NAC activity. In one 
embodiment, the activity of the fragment will be 
"dominant negative." A dominant negative activity will 
allow the fragment to reduce or inactivate the activity 
of one or more isoforms of wild type NAC. 

Isoforms of the NAC proteins are also provided which 
arise from alternative mRNA splicing and may alter or 
modify the interactions of the NAC protein with other 
proteins. For example, three novel isoforms of NAC are 
provided herein and designated: NACP, NACy and NAC5 (set 
forth as SEQ ID Nos:l, 3 and 5, respectively). The amino 
acid sequence and the portion of cDNA encoding the amino 



acid sequence of NACp is shown in Figure 1C, and the NACP 
cDNA and amino acid sequences are listed as SEQ ID NOs : 1 
and 2, respectively. NAC(3 represents the NAC splice 
variant in which both splice regions are present in the 
translated polypeptide, thereby including the nucleic 
acids 1-4422 of the NAC cDNA sequence and amino acids 
1-1473 of the NAC protein sequence of Figure 1C. NACy 
represents the NAC splice variant in which neither splice 
region is present in the translated polypeptide, thereby 
including the nucleic acids 1-2869, 2960-3783, and 
3916-4422 of the NAC cDNA sequence and amino acids 1-917, 
948-1261, and 1306-1473 of the NAC protein sequence of 
Figure 1C. The NACy cDNA and amino acid sequences are 
listed as SEQ ID NOs : 3 and 4, respectively. NAC5 
represents the NAC splice variant in which only the more 
carboxy-terminal splice region is present in the 
translated polypeptide, thereby including the nucleic 
acids 1-2869, and 2960-4422 of the NAC cDNA sequence and 
amino acids 1-917, and 948-1473 of the NAC protein 
sequence of Figure 1C. The NACS cDNA and amino acid 
sequences are listed as SEQ ID NOs : 5 and 6, respectively. 

In another embodiment of the invention, chimeric 
proteins are provided comprising NAC, or a functional 
fragment thereof, fused with another protein or 
functional fragment thereof. Functional fragments of NAC 
include, for example, NB-ARC, CARD, LRR and TIM-Barrel- 
like domains, as defined herein. Proteins with which the 
NAC or functional fragment thereof are fused will 
include, for example, glutathione-S-transf erase, an 
antibody, or other proteins or functional fragments 
thereof which facilitate recovery of the chimera. 
Further proteins with which the NAC or functional 
fragment thereof are fused will include, for example, 



luciferase, green fluorescent protein, an antibody, or 
other proteins or functional fragments thereof which 
facilitate identification of the chimera. Still further 
proteins with which the NAC or functional fragment 
thereof are fused will include, for example, the LexA DNA 
binding domain, ricin, a-sarcin, an antibody, or other 
proteins which have therapeutic properties or other 
biological activity. 

Further invention chimeric proteins contemplated 
herein are chimeric proteins wherein a domain of the NAC 
is replaced by a similar such domain from a heterologous 
protein. For example, the NB-ARC domain of NAC, as 
described above, can be replaced by the NB-ARC domain of 
Apaf-1, and the like. Another example of such a chimera 
is a protein wherein the CARD domain of NAC is replaced 
by the CARD domain from CED-4, and the like. 

The CARD-X protein contains a CARD domain and a 
region with similarity to TIM-Barrel-like domains, but 
otherwise is distinct from NAC, The cDNA sequence 
encoding CARD-X (SEQ ID NO : 7 ) reveals that it arises from 
a separate gene from NAC. The predicted CARD-X amino 
acid sequence (SEQ ID NO: 8), in particular, does not 
contain an NB-ARC domain. 

A CARD domain of the CARD-X protein comprises a 
sequence with at least 50% identity to the CARD domain of 
CARD-X (residues 343-431 of SEQ ID NO: 8). More 
preferably, a CARD domain of the invention comprises a 
sequence with at least 60% identity to the CARD domain of 
CARD-X. Most preferably, a CARD domain of the invention 
comprises a sequence with at least 75% identity to the 
CARD domain of CARD-X. Typically, a CARD domain of the 



invention comprises a sequence with at least 95% identity 
to the CARD domain of CARD-X . 

A TIM-Barrel-like domain of CARD-X comprises a 
sequence with at least 50% identity to the TIM-Barrel 
domain of CARD-X (residues 56-331 of SEQ ID NO: 8) . 
Preferably, a TIM-barrel domain of the invention NAC 
comprises a sequence with at least 60% identity to the 
TIM-Barrel domain of CARD-X. More preferably, a 
TIM-barrel domain of the invention CARD-X comprises a 
sequence with at least 70% identity to the TIM-barrel 
domain of CARD-X. Most preferably, a TIM-barrel domain 
of the CARD-X comprises a sequence with at least 80% 
identity to the TIM-barrel domain of CARD-X. 

In one embodiment, invention chimeric 
CARD-containing proteins provided herein are designated 
NAC-X. Nucleic acids that encode NAC-X are also provided 
herein. Alternative isoforms of the NAC-X proteins and 
the corresponding nucleic acids that encode the 
alternative isoforms are also provided. As used herein, 
the term "NAC-X" refers to chimeric proteins comprising 
portions of a NAC and portions of CARD-X. For example, 
one type of NAC-X protein is a NACS-X, wherein a portion 
of NAC8, for example, the TIM-Barrel-like domain of NAC5, 
is replaced by a portion of CARD-X, for example, the 
TIM-Barrel-like domain of CARD-X. It is within the scope 
of this invention that a protein comprising portions of a 
domain common to both NAC and CARD-X, particularly the 
CARD and TIM-Barrel-like domains, can comprise a chimera 
of NAC and CARD-X. For example, a NAC0-X protein can 
have residues 1-1397 from SEQ ID NO: 2 immediately 
followed by residues 364-402 from SEQ ID NO: 8, which are 
in turn immediately followed by residues 1436-1473 from 
SEQ ID NO: 2, thus forming a chimeric CARD domain. 



In one embodiment, a NAC-X protein will comprise an 
NB-ARC domain of NAC, as previously described, and the 
CARD domain of CARD-X. In another embodiment, a NAC-X 
protein will comprise the NB-ARC domain and LRR domain of 
NAC, the CARD domain of CARD-X, and the TIM-Barrel-like 
domain from either NAC or CARD-X or a chimera from both. 
In yet another embodiment, NAC-X will comprise the NB-ARC 
and LRR domains of NAC and the CARD and TIM-Barrel-like 
domains of CARD-X. For example, invention chimeric 
proteins can include residues between 1-947 and 1-1078 of 
NACp (SEQ ID NO: 2) or between 1-918 and 1-1048 of NACy or 
NAC 5 (SEQ ID NOs:4 and 6, respectively) in chimera with 
residues between 1-431 and 56-431 of CARD-X (SEQ ID 
NO: 8). A particular invention chimera is termed NAC-X1 a 
protein, and comprises the following sequences: NACp-Xl, 
residues 1-1078 of NACp and residues 56-431 of CARD-X, 
having the resultant amino acid sequence listed in SEQ ID 
NO: 10; NACy/5-Xl residues 1-1048 of NACy or NAC5 and 
residues 56-431 of CARD-X, having the resultant amino 
acid sequence listed in SEQ ID NO: 12. The cDNA encoding 
NACP-X1 comprises cDNA residues 1-3234 of NACP and 
166-1293 of CARD-X, having the resultant sequence listed 
in SEQ ID NO: 9; and the cDNA encoding NACy/5-Xl proteins 
comprise cDNA residues 1-3144 of NACy or NAC5 and 166-1293 
of CARD-X, having the resultant sequence listed in SEQ ID 
NO : 1 1 . 

Another embodiment of the invention provides NAC, or 
a functional fragment thereof, fused with a moiety to 
form a conjugate. As used herein, a "moiety" can be a 
physical, chemical or biological entity which contributes 



functionality to NAC or a functional fragment thereof. 
Functionalities contributed by a moiety include 
therapeutic or other biological activity, or the ability 
to facilitate identification or recovery of NAC. 
Therefore, a moiety will include molecules known in the 
art to be useful for detection of the conjugate by, for 
example, by fluorescence, magnetic imaging, detection of 
radioactive emission. A moiety may also be useful for 
recovery of the conjugate, for example a His tag or other 
known tags used for protein isolation/purification, or a 
physical substance such as a bead. A moiety can be a 
therapeutic compound, for example, a cytotoxic drug which 
can be useful to effect a biological change in cells to 
which the conjugate localizes. 

An example of the means for preparing the invention 
polypeptide (s) is to express nucleic acids encoding the 
NAC in a suitable host cell, such as a bacterial cell, a 
yeast cell, an amphibian cell (i.e., oocyte), or a 
mammalian cell, using methods well known in the art, and 
recovering the expressed polypeptide, again using 
well-known methods. Invention polypeptides can be 
isolated directly from cells that have been transformed 
with expression vectors as described below herein. The 
invention polypeptide, biologically functional fragments, 
and functional equivalents thereof can also be produced 
by chemical synthesis. For example, synthetic 
polypeptides can be produced using Applied Biosystems, 
Inc. Model 430A or 431A automatic peptide synthesizer 
(Foster City, CA) employing the chemistry provided by the 
manufacturer . 

Also encompassed by the term NAC are functional 
fragments or polypeptide analogs thereof. The term 
"functional fragment" refers to a peptide fragment that 



is a portion of a full length NAC protein, provided that 
the portion has one or more biological activities, as 
defined above, that is characteristic of the 
corresponding full length NAC. For example, a functional 
fragment of an invention NAC protein can have one or more 
of the protein:protein binding activities prevalent in 
NAC. In addition, the characteristic of a functional 
fragment of invention NAC proteins to elicit an immune 
response is useful for obtaining an anti-NAC antibodies. 
Thus, the invention also provides functional fragments of 
invention NAC proteins, which can be identified using the 
binding and routine methods, such as bioassays described 
herein. 

The term "polypeptide analog" includes any 
polypeptide having an amino acid residue sequence 
substantially the same as a sequence specifically shown 
herein in which one or more residues have been 
conservatively substituted with a functionally similar 
residue and which displays the ability to functionally 
mimic an NAC as described herein. Examples of 
conservative substitutions include the substitution of 
one non-polar (hydrophobic) residue such as isoleucine, 
valine, leucine or methionine for another, the 
substitution of one polar (hydrophilic) residue for 
another such as between arginine and lysine, between 
glutamine and asparagine, between glycine and serine, the 
substitution of one basic residue such as lysine, 
arginine or histidine for another, or the substitution of 
one acidic residue, such as aspartic acid or glutamic 
acid for another . 

The amino acid length of functional fragments or 
polypeptide analogs of the present invention can range 
from about 5 amino acids up to the full-length protein 



sequence of an invention NAC. In certain embodiments, 
the amino acid lengths include, for example, at least 
about 10 amino acids, at least about 20, at least about 
30, at least about 40, at least about 50, at least about 
75, at least about 100, at least about 150, at least 
about 200, at least about 250 or more amino acids in 
length up to the full-length NAC protein sequence. 

As used herein the phrase "conservative 
substitution" also includes the use of a chemically 
derivatized residue in place of a non-derivatized 
residue, provided that such polypeptide displays the 
required binding activity. The phrase "chemical 
derivative" refers to a subject polypeptide having one or 
more residues chemically derivatized by reaction of a 
functional side group. Such derivatized molecules 
include, for example, those molecules in which free amino 
groups have been derivatized to form amine 
hydrochlorides, p-toluene sulfonyl groups, carbobenzoxy 
groups, t-butyloxycarbonyl groups, chloroacetyl groups or 
formyl groups. Free carboxyl groups may be derivatized 
to form salts, methyl and ethyl esters or other types of 
esters or hydrazides. Free hydroxyl groups may be 
derivatized to form O-acyl or O-alkyl derivatives. The 
imidazole nitrogen of histidine may be derivatized to 
form N-im-benzylhistidine. Also included as chemical 
derivatives are those peptides which contain one or more 
naturally occurring amino acid derivatives of the twenty 
standard amino acids. For examples: 4-hydroxyproline 
may be substituted for proline; 5-hydroxylysine may be 
substituted for lysine; 3-methylhistidine may be 
substituted for histidine; homoserine may be substituted 
for serine; and ornithine may be substituted for lysine. 
Polypeptides of the present invention also include any 
polypeptide having one or more additions and/or deletions 
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of residues, relative to the sequence of a polypeptide 
whose sequence is shown herein, so long as the required 
activity is maintained. 

The present invention also provides compositions 
containing an acceptable carrier and any of an isolated, 
purified NAC mature protein or functional polypeptide 
fragments thereof, alone or in combination with each 
other. These polypeptides or proteins can be 
recombinantly derived, chemically synthesized or purified 
from native sources. As used herein, the term 
"acceptable carrier" encompasses any of the standard 
pharmaceutical carriers, such as phosphate buffered 
saline solution, water and emulsions such as an oil/water 
or water/oil emulsion, and various types of wetting 
agents. The NAC compositions described herein can be 
used, for example, in methods described hereinafter. 

In accordance with another embodiment of the 
invention, substantially pure nucleic acid molecules, and 
functional fragments thereof, are provided, which encode 
invention NACs . Exemplary invention nucleic acid 
molecules are those comprising substantially the same 
nucleotide sequence encoding NAC|3 (SEQ ID NO: 1), NACy 
(SEQ ID NO: 3), and NAC5 (SEQ ID NO: 5) . 

The nucleic acid molecules described herein are 
useful for producing invention proteins, when such 
nucleic acids are incorporated into a variety of protein 
expression systems known to those of skill in the art. 
In addition, such nucleic acid molecules or fragments 
thereof can be labeled with a readily detectable 
substituent and used as hybridization probes for assaying 
for the presence and/or amount of an invention NAC gene 
or mRNA transcript in a given sample. The nucleic acid 
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molecules described herein, and fragments thereof, are 
also useful as primers and/or templates in a PCR reaction 
for amplifying genes encoding invention proteins 
described herein. 

The term "nucleic acid" (also referred to as 
polynucleotides) encompasses ribonucleic acid (RNA) or 
deoxyribonucleic acid (DNA) , probes, oligonucleotides, 
and primers. DNA can be either complementary DNA (cDNA) 
or genomic DNA, e.g. a gene encoding a NAC . One means of 
isolating a nucleic acid encoding an NAC polypeptide is 
to probe a mammalian genomic library with a natural or 
artificially designed DNA probe using methods well known 
in the art. DNA probes derived from the NAC gene are 
particularly useful for this purpose. DNA and cDNA 
molecules that encode NAC polypeptides can be used to 
obtain complementary genomic DNA, cDNA or RNA from 
mammalian (e.g., human, mouse, rat, rabbit, pig, and the 
like) , or other animal sources, or to isolate related 
cDNA or genomic clones by the screening of cDNA or 
genomic libraries, by methods described in more detail 
below. Such nucleic acids may include, but are not 
limited to, nucleic acids comprising substantially the 
same nucleotide sequence as set forth in SEQ ID NOs : 1 
(NAC(3), 3 (NACy) , and 5 (NAC5) . 

Use of the terms "isolated" and/or "purified" and/or 
"substantially purified" in the present specification and 
claims as a modifier of DNA, RNA, polypeptides or 
proteins means that the DNA, RNA, polypeptides or 
proteins so designated have been produced in such form by 
the hand of man, and thus are separated from their native 
in vivo cellular environment, and are substantially free 
of any other species of nucleic acid or protein. As a 
result of this human intervention, the recombinant DNAs, 
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RNAs, polypeptides and proteins of the invention are 
useful in ways described herein that the DNAs, RNAs, 
polypeptides or proteins as they naturally occur are not. 

Invention NAC proteins and nucleic acids encoding 
such, can be obtained from any species of organism, such 
as prokaryotes, eukaryotes, plants, fungi, vertebrates, 
invertebrates, and the like. A particular species can be 
mammalian, As used herein, "mammalian" refers to a subset 
of species from which an invention NAC is derived, e.g., 
human, rat, mouse, rabbit, monkey, baboon, bovine, 
porcine, ovine, canine, feline, and the like. A 
preferred NAC herein, is human NAC. 

In one embodiment of the present invention, cDNAs 
encoding the invention NACs disclosed herein comprise 
substantially the same nucleotide sequence as the coding 
region set forth in any of SEQ ID NOs :1, 3 and 5. 
Preferred cDNA molecules encoding the invention proteins 
comprise the same nucleotide sequence as the coding 
region set forth in any of SEQ ID NOs :1, 3 and 5. 

As employed herein, the term "substantially the same 
nucleotide sequence" refers to DNA having sufficient 
identity to the reference polynucleotide, such that it 
will hybridize to the reference nucleotide under 
moderately stringent hybridization conditions. In one 
embodiment, DNA having substantially the same nucleotide 
sequence as the reference nucleotide sequence encodes 
substantially the same amino acid sequence as that set 
forth in any of SEQ ID NOs: 2, 4, 6, 10 or 12. In another 
embodiment, DNA having "substantially the same nucleotide 
sequence" as the reference nucleotide sequence has at 
least 60% identity with respect to the reference 
nucleotide sequence. DNA having at least 70%, more 



preferably at least 90%, yet more preferably at least 
95%, identity to the reference nucleotide sequence is 
preferred. 

This invention also encompasses nucleic acids which 
differ from the nucleic acids shown in SEQ ID NOs :1, 3 
and 5, but which have the same phenotype. Phenotypically 
similar nucleic acids are also referred to as 
"functionally equivalent nucleic acids". As used 
herein, the phrase "functionally equivalent nucleic 
acids" encompasses nucleic acids characterized by slight 
and non-consequential sequence variations that will 
function in substantially the same manner to produce the 
same protein product (s) as the nucleic acids disclosed 
herein. In particular, functionally equivalent nucleic 
acids encode polypeptides that are the same as those 
encoded by the nucleic acids disclosed herein or that 
have conservative amino acid variations. For example, 
conservative variations include substitution of a 
non-polar residue with another non-polar residue, or 
substitution of a charged residue with a similarly 
charged residue. These variations include those 
recognized by skilled artisans as those that do not 
substantially alter the tertiary structure of the 
protein . 

Further provided are nucleic acids encoding NAC 
polypeptides that, by virtue of the degeneracy of the 
genetic code, do not necessarily hybridize to the 
invention nucleic acids under specified hybridization 
conditions. Preferred nucleic acids encoding the 
invention NACs are comprised of nucleotides that encode 
substantially the same amino acid sequence as set forth 
in SEQ ID NOs:2, 4, 6, 10 or 12. 



Thus, an exemplary nucleic acid encoding an 
invention NAC may be selected from: 

(a) DNA encoding the amino acid sequence set 
forth in SEQ ID N0s:2, 4, 6, 10 or 12, 

(b) DNA that hybridizes to the DNA of (a) 
under moderately stringent conditions, wherein said 
DNA encodes biologically active NAC, or 

(c) DNA degenerate with respect to (b) wherein 
said DNA encodes biologically active NAC. 

Hybridization refers to the binding of complementary 
strands of nucleic acid (i.e., sense : antisense strands or 
probe : target-DNA) to each other through hydrogen bonds, 
similar to the bonds that naturally occur in chromosomal 
DNA. Stringency levels used to hybridize a given probe 
with target-DNA can be readily varied by those of skill 
in the art . 

The phrase "stringent hybridization" is used herein 
to refer to conditions under which polynucleic acid 
hybrids are stable. As known to those of skill in the 
art, the stability of hybrids is reflected in the melting 
temperature (TJ of the hybrids. In general, the 
stability of a hybrid is a function of sodium ion 
concentration and temperature. Typically, the 
hybridization reaction is performed under conditions of 
lower stringency, followed by washes of varying, but 
higher, stringency. Reference to hybridization 
stringency relates to such washing conditions. 

As used herein, the phrase "moderately stringent 
hybridization" refers to conditions that permit 
target-DNA to bind a complementary nucleic acid that has 
about 60% identity, preferably about 75% identity, more 
preferably about 85% identity to the target DNA; with 



greater than about 90% identity to target-DNA being 
especially preferred. Preferably, moderately stringent 
conditions are conditions equivalent to hybridization in 
50% formamide, 5X Denhart T s solution, 5X SSPE, 0.2% SDS 
at 42°C, followed by washing in 0.2X SSPE, 0.2% SDS, at 
65°C. 

The phrase "high stringency hybridization" refers to 
conditions that permit hybridization of only those 
nucleic acid sequences that form stable hybrids in 0.018M 
NaCl at 65°C (i.e., if a hybrid is not stable in 0.018M 
NaCl at 65°C, it will not be stable under high stringency 
conditions, as contemplated herein) . High stringency 
conditions can be provided, for example, by hybridization 
in 50% formamide, 5X Denhart T s solution, 5X SSPE, 0.2% 
SDS at 42°C, followed by washing in 0 . IX SSPE, and 0.1% 
SDS at 65°C. 

The phrase "low stringency hybridization" refers to 
conditions equivalent to hybridization in 10% formamide, 
5X Denhart f s solution, 6X SSPE, 0.2% SDS at 42°C, 
followed by washing in IX SSPE, 0.2% SDS, at 50°C. 
Denhart T s solution and SSPE (see, e.g., Sambrook et al., 
Molecular Cloning, A Laboratory Manual , Cold Spring 
Harbor Laboratory Press, (1989)) are well known to those 
of skill in the art as are other suitable hybridization 
buffers . 

As used herein, the term "degenerate" refers to 
codons that differ in at least one nucleotide from a 
reference nucleic acid, e.g., SEQ ID NOs :1, 3 and 5, but 
encode the same amino acids as the reference nucleic 
acid. For example, codons specified by the triplets 
"UCU", "UCC", "UCA", and "UCG" are degenerate with 



respect to each other since all four of these codons 
encode the amino acid serine. 

Preferred nucleic acids encoding the invention 
polypeptide (s) hybridize under moderately stringent, 
preferably high stringency, conditions to substantially 
the entire sequence, or substantial portions (i.e., 
typically at least 15-30 nucleotides) of the nucleic acid 
sequence set forth in SEQ ID NOs :1, 3 and 5. 

The invention nucleic acids can be produced by a 
variety of methods well-known in the art, e.g., the 
methods described herein, employing PCR amplification 
using oligonucleotide primers from various regions of SEQ 
ID NOs :1, 3 and 5, and the like. 

In accordance with a further embodiment of the 
present invention, optionally labeled NAC-encoding cDNAs, 
or fragments thereof, can be employed to probe 
library (ies) (e.g., cDNA, genomic, and the like) for 
additional nucleic acid sequences encoding novel NACs. 
Construction of suitable mammalian cDNA libraries, 
including mammalian cDNA libraries, is well-known in the 
art. Screening of such a cDNA library is initially 
carried out under low-stringency conditions, which 
comprise a temperature of less than about 42°C, a 
formamide concentration of less than about 50%, and a 
moderate to low salt concentration. 

Presently preferred probe-based screening conditions 
comprise a temperature of about 37°C, a formamide 
concentration of about 20%, and a salt concentration of 
about 5X .standard saline citrate (SSC; 20X SSC contains 
3M sodium chloride, 0 . 3M sodium citrate, pH 7.0). Such 
conditions will allow the identification of sequences 
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which have a substantial degree of similarity with the 
probe sequence, without requiring perfect homology. The 
phrase "substantial similarity" refers to sequences which 
share at least 50% homology. Preferably, hybridization 
5 conditions will be selected which allow the 

identification of sequences having at least 70% homology 
with the probe, while discriminating against sequences 
which have a lower degree of homology with the probe. As 
a result, nucleic acids having substantially the same 
10 nucleotide sequence as SEQ ID NOs :1, 3 and 5 are 
obtained. 

As used herein, a nucleic acid "probe" is 
single-stranded DNA or RNA, or analogs thereof, that has 

15 a sequence of nucleotides that includes at least 15, at 
least 20, at least 50, at least 100, at least 200, at 
least 300, at least 400, or at least 500 contiguous bases 
that are the same as (or the complement of) any 
contiguous bases set forth in any of SEQ ID NOs :1, 3 and 

20 5. Preferred regions from which to construct probes 

include 5 1 and/or 3' coding regions of SEQ ID NOs :1, 3 
and 5. In addition, the entire cDNA encoding region of 
an invention NAC, or the entire sequence corresponding to 
SEQ ID NOs :1, 3 and 5, may be used as a probe. Probes 

25 may be labeled by methods well-known in the art, as 
described hereinafter, and used in various diagnostic 
kits . 

As used herein, the terms "label" and "indicating 
30 means" in their various grammatical forms refer to single 
atoms and molecules that are either directly or 
indirectly involved in the production of a detectable 
signal. Any label or indicating means can be linked to 
invention nucleic acid probes, expressed proteins, 
35 polypeptide fragments, or antibody molecules. These 



atoms or molecules can be used alone or in conjunction 
with additional reagents. Such labels are themselves 
well-known in clinical diagnostic chemistry. 

The labeling means can be a fluorescent labeling 
agent that chemically binds to antibodies or antigens 
without denaturation to form a fluorochrome (dye) that is 
a useful immunof luorescent tracer. A description of 
immunof luorescent analytic techniques is found in DeLuca, 
"Immunofluorescence Analysis" , in Antibody As a Tool , 
Marchalonis et al . , eds . , John Wiley & Sons, Ltd., pp. 
189-231 (1982), which is incorporated herein by 
reference . 

In one embodiment, the indicating group is an 
enzyme, such as horseradish peroxidase (HRP) , glucose 
oxidase, and the like. In another embodiment, 
radioactive elements are employed labeling agents. The 
linking of a label to a substrate, i.e., labeling of 
nucleic acid probes, antibodies, polypeptides, and 
proteins, is well known in the art. For instance, an 
invention antibody can be labeled by metabolic 
incorporation of radiolabeled amino acids provided in the 
culture medium. See, for example, Galfre et al . , Meth. 
Enzymol . , 73:3-46 (1981). Conventional means of protein 
conjugation or coupling by activated functional groups 
are particularly applicable. See, for example, Aurameas 
et al., Scand. J. Immunol. , Vol. 8, Suppl . 7:7-23 (1978), 
Rodwell et al . , Biotech. , 3:889-894 (1984), and U.S. 
Patent No. 4,493,795. 

Also provided are antisense-nucleic acids having a 
sequence capable of binding specifically with full-length 
or any portion of an mRNA that encodes NAC polypeptides 
so as to prevent translation of the mRNA. The 
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antisense-nucleic acid may have a sequence capable of 
binding specifically with any portion of the sequence of 
the cDNA encoding NAC polypeptides. As used herein, the 
phrase "binding specifically" encompasses the ability of 
a nucleic acid sequence to recognize a complementary 
nucleic acid sequence and to form double-helical segments 
therewith via the formation of hydrogen bonds between the 
complementary base pairs. An example of an 
antisense-nucleic acid is an antisense-nucleic acid 
comprising chemical analogs of nucleotides. 

Compositions comprising an amount of the 
antisense-nucleic acid, described above, effective to 
reduce expression of NAC polypeptides by passing through 
a cell membrane and binding specifically with mRNA 
encoding NAC polypeptides so as to prevent translation 
and an acceptable hydrophobic carrier capable of passing 
through a cell membrane are also provided herein. 
Suitable hydrophobic carriers are described, for example, 
in U.S. Patent Nos. 5,334,761; 4,889,953; 4,897,355, and 
the like. The acceptable hydrophobic carrier capable of 
passing through cell membranes may also comprise a 
structure which binds to a receptor specific for a 
selected cell type and is thereby taken up by cells of 
the selected cell type. The structure may be part of a 
protein known to bind to a cell-type specific receptor. 

Antisense-nucleic acid compositions are useful to 
inhibit translation of mRNA encoding invention 
polypeptides. Synthetic oligonucleotides, or other 
antisense chemical structures are designed to bind to 
mRNA encoding NAC polypeptides and inhibit translation of 
mRNA and are useful as compositions to inhibit expression 
of NAC associated genes in a tissue sample or in a 
subj ect . 
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In accordance with another embodiment of the 
invention, kits are provided for detecting mutations, 
duplications, deletions, rearrangements and aneuploidies 
in NAC genes comprising at least one invention probe or 
antisense nucleotide . 

The present invention provides means to modulate 
levels of expression of NAC polypeptides by employing 
synthetic antisense-nucleic acid compositions 
(hereinafter SANC) which inhibit translation of mRNA 
encoding these polypeptides. Synthetic oligonucleotides, 
or other antisense-nucleic acid chemical structures 
designed to recognize and selectively bind to mRNA, are 
constructed to be complementary to full-length or 
portions of an NAC coding strand, including nucleotide 
sequences set forth in SEQ ID NOs :1, 3 and 5 . The SANC 
is designed to be stable in the blood stream for 
administration to a subject by injection, or in 
laboratory cell culture conditions. The SANC is designed 
to be capable of passing through the cell membrane in 
order to enter the cytoplasm of the cell by virtue of 
physical and chemical properties of the SANC which render 
it capable of passing through cell membranes, for 
example, by designing small, hydrophobic SANC chemical 
structures, or by virtue of specific transport systems in 
the cell which recognize and transport the SANC into the 
cell. In addition, the SANC can be designed for 
administration only to certain selected cell populations 
by targeting the SANC to be recognized by specific 
cellular uptake mechanisms which bind and take up the 
SANC only within select cell populations. In a 
particular embodiment the SANC is an antisense 
oligonucleotide . 



For example, the SANC may be designed to bind to a 
receptor found only in a certain cell type, as discussed 
supra. The SANC is also designed to recognize and 
selectively bind to target mRNA sequence, which may 
correspond to a sequence contained within the sequences 
shown in SEQ ID NOs :1, 3 and 5. The SANC is designed to 
inactivate target mRNA sequence by either binding thereto 
and inducing degradation of the mRNA by, for example, 
RNase I digestion, or inhibiting translation of mRNA 
target sequence by interfering with the binding of 
translation-regulating factors or ribosomes, or inclusion 
of other chemical structures, such as ribozyme sequences 
or reactive chemical groups which either degrade or 
chemically modify the target mRNA. SANCs have been shown 
to be capable of such properties when directed against 
mRNA targets (see Cohen et al . , TIPS , 10:435 (1989) and 
Weintraub, Sci. American , January (1990), pp.40; both 
incorporated herein by reference) . 

In accordance with yet another embodiment of the 
present invention, there is provided a method for the 
recombinant production of invention NAC by expressing the 
above-described nucleic acid sequences in suitable host 
cells. Recombinant DNA expression systems that are 
suitable to produce NAC described herein are well-known 
in the art. For example, the above-described nucleotide 
sequences can be incorporated into vectors for further 
manipulation. As used herein, vector (or plasmid) refers 
to discrete elements that are used to introduce 
heterologous DNA into cells for either expression or 
replication thereof. 

Suitable expression vectors are well-known in the 
art, and include vectors capable of expressing DNA 
operatively linked to a regulatory sequence, such as a 
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promoter region that is capable of regulating expression 
of such DNA. Thus, an expression vector refers to a 
recombinant DNA or RNA construct, such as a plasmid, a 
phage, recombinant virus or other vector that, upon 
5 introduction into an appropriate host cell, results in 
expression of the inserted DNA. Appropriate expression 
vectors are well known to those of skill in the art and 
include those that are replicable in eukaryotic cells 
and/or prokaryotic cells and those that remain episomal 
10 or those which integrate into the host cell genome. 

Prokaryotic transformation vectors are well-known in 
the art and include pBlueskript and phage Lambda ZAP 
vectors (Stratagene, La Jolla, CA) , and the like. Other 
15 suitable vectors and promoters are disclosed in detail in 
U.S. Patent No. 4,798,885, issued January 17, 1989, the 
disclosure of which is incorporated herein by reference 
in its entirety. 

Other suitable vectors for transformation of E. coli 
cells include the pET expression vectors (Novagen, see 
U.S. patent 4,952,496), e.g., pETlla, which contains the 
T7 promoter, T7 terminator, the inducible E. coli lac 
operator, and the lac repressor gene; and pET 12a-c, 
which contain the T7 promoter, T7 terminator, and the E. 
coli ompT secretion signal. Another suitable vector is 
the pIN-lIIompA2 (see Duffaud et al . , Meth. in 
Enzymology , 153:492-507, 1987), which contains the lpp 
promoter, the lacUV5 promoter operator, the ompA 
secretion signal, and the lac repressor gene. 

In accordance with another embodiment of the present 
invention, there are provided "recombinant cells" 
containing the nucleic acid molecules (i.e., DNA or mRNA) 
35 of the present invention. Methods of transforming 



suitable host cells, preferably bacterial cells, and more 
preferably E. coli cells, as well as methods applicable 
for culturing said cells containing a gene encoding a 
heterologous protein, are generally known in the art. 
See, for example, Sambrook et al . , Molecular Cloning : A 
Laboratory Manual (2 edj, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York, USA (1989) . 

Exemplary methods of transformation include, e.g., 
transformation employing plasmids, viral, or bacterial 
phage vectors, transf ection, electroporation, 
lipofection, and the like. The heterologous DNA can 
optionally include sequences which allow for its 
extrachromosomal maintenance, or said heterologous DNA 
can be caused to integrate into the genome of the host 
(as an alternative means to ensure stable maintenance in 
the host) . 

Host organisms contemplated for use in the practice 
of the present invention include those organisms in which 
recombinant production of heterologous proteins has been 
carried out. Examples of such host organisms include 
bacteria (e.g., E. coli), yeast (e.g., Saccharomyces 
cerevisiae r Candida tropicalis, Hansenula polymorpha and 
P. pastoris; see, e.g., U.S. Patent Nos . 4,882,279, 
4,837,148, 4,929,555 and 4,855,231), mammalian cells 
(e.g., HEK293, CHO and Ltk" cells), insect cells, and the 
like. Presently preferred host organisms are bacteria. 
The most preferred bacteria is E. coli. 

In one embodiment, nucleic acids encoding the 
invention NAC can be delivered into mammalian cells, 
either in vivo or in vitro using suitable viral vectors 
well-known in the art. Suitable retroviral vectors, 
designed specifically for "gene therapy" methods, are 



described, for example, in WIPO publications WO 9205266 
and WO 9214829, which provide a description of methods 
for efficiently introducing nucleic acids into human 
cells. In addition, where it is desirable to limit or 
reduce the In vivo expression of the invention NAC, the 
introduction of the antisense strand of the invention 
nucleic acid is contemplated. 

For example, in one embodiment of the present 
invention, adenovirus-transf err in/poly lysine-DNA 
(TfAdpl-DNA) vector complexes (Wagner et al., Proc. Natl. 
Acad. Sci., USA , 89:6099-6103 (1992); Curiel et al . , Hum. 
Gene Ther . , 3:147-154 (1992); Gao et al . , Hum. Gene 
Ther . , 4:14-24 (1993)) are employed to transduce 
mammalian cells with heterologous NAC nucleic acid. Any 
of the plasmid expression vectors described herein may be 
employed in a TfAdpl-DNA complex. 

In accordance with yet another embodiment of the 
present invention, there are provided anti-NAC antibodies 
having specific reactivity with an NAC polypeptides of 
the present invention. The present invention also 
provides anti-NACP, anti-NACy, anti-NAC5, anti-NACp-Xl , or 
anti-NACy/5-Xl antibodies. It should be recognized that 
an antibody of the invention can be specific for an 
epitope that is present only in a particular type of NAC 
or can be specific for an epitope that is common to more 
than one type of NAC. For example, an anti-NACS antibody 
can be specific for only NAC5 or for more than one member 
of the NAC family. As used herein, the term "antibody" 
is used in its broadest sense to include polyclonal and 
monoclonal antibodies, as well as polypeptide fragments 
of antibodies that retain a specific binding activity for 
a specific antigen of at least about 1 x 105 M-l. One 



skilled in the art would know that, for example, 
anti-NACp antibody fragments or anti-NACy antibody 
fragments such as Fab, F(ab T )2, Fv and Fd fragments can 
retain specific binding activity for a NACP or a NACy, 
respectively, and, thus, are included within the 
definition of an antibody. In addition, the term 
"antibody" as used herein includes naturally occurring 
antibodies as well as non-naturally occurring antibodies 
and fragments of antibodies that retain binding activity. 
Such non-naturally occurring antibodies can be 
constructed using solid phase peptide synthesis, can be 
produced recombinantly or can be obtained, for example, 
by screening combinatorial libraries consisting of 
variable heavy chains and variable light chains as 
described by Huse et al . , Science 246:1275-1281 (1989), 
which is incorporated herein by reference. 

Invention antibodies can be produced by methods 
known in the art using invention polypeptides, proteins 
or portions thereof as antigens. For example, polyclonal 
and monoclonal antibodies can be produced by methods well 
known in the art, as described, for example, in Harlow 
and Lane, Antibodies: A Laboratory Manual (Cold Spring 
Harbor Laboratory (1988)), which is incorporated herein 
by reference. Invention polypeptides can be used as 
immunogens in generating such antibodies. Alternatively, 
synthetic peptides can be prepared (using commercially 
available synthesizers) and used as immunogens. Amino 
acid sequences can be analyzed by methods well known in 
the art to determine whether they encode hydrophobic or 
hydrophilic domains of the corresponding polypeptide. 
Altered antibodies such as chimeric, humanized, 
CDR-grafted or bifunctional antibodies can also be 
produced by methods well known in the art. Such 
antibodies can also be produced by hybridoma, chemical 



synthesis or recombinant methods described, for example, 
in Sambrook et al . , supra . , and Harlow and Lane, supra . 
Both anti-peptide and anti-fusion protein antibodies can 
be used, (see, for example, Bahouth et al . , Trends 
Pharmacol. Sci . 12:338 (1991); Ausubel et al . , Current 
Protocols in Molecular Biology (John Wiley and Sons, NY 
(1989) which are incorporated herein by reference) . 

In the case of monoclonal antibodies specific to 
NAC, it is also contemplated herein that the invention 
includes hybridomas and any other type of cell line which 
produces a monoclonal antibody. Methods of preparing 
hybridomas are described for example, in Sambrook et al., 
supra . , and Harlow and Lane, supra ; and preparation of 
any non-hybridoma cell line producing a monoclonal 
antibody specific to NAC can be carried out in accordance 
with the methods known in the art and methods described 
herein for protein expression in cells such as bacterial 
cells, yeast cells, amphibian cells, mammalian cells, and 
the like. 

Antibody so produced can be used, inter alia , in 
diagnostic methods and systems to detect the level of NAC 
present in a mammalian, preferably human, body sample, 
such as tissue or vascular fluid. Such antibodies can 
also be used for the immunoaf f inity or affinity 
chromatography purification of the invention NAC. In 
addition, methods are contemplated herein for detecting 
the presence of an invention NAC protein in a tissue or 
cell, comprising contacting the cell with an antibody 
that specifically binds to NAC polypeptides, under 
conditions permitting binding of the antibody to the NAC 
polypeptides, detecting the presence of the antibody 
bound to the NAC polypeptide, and thereby detecting the 
presence of invention polypeptides. With respect to the 



detection of such polypeptides, the antibodies can be 
used for in vitro diagnostic or in vivo imaging methods. 

Immunological procedures useful for in vitro 
detection of target NAC polypeptides in a sample include 
immunoassays that employ a detectable antibody. Such 
immunoassays include, for example, ELISA, Pandex 
microf luorimetric assay, agglutination assays, flow 
cytometry, serum diagnostic assays and 

immunohistochemical staining procedures which are well 
known in the art. An antibody can be made detectable by 
various means well known in the art. For example, a 
detectable marker can be directly or indirectly attached 
to the antibody. Useful markers include, for example, 
radionucleotides, enzymes, fluorogens, chromogens and 
chemiluminescent labels . 

Invention anti-NAC antibodies are contemplated for 
use herein to modulate the activity of the NAC 
polypeptide in living animals, in humans, or in 
biological tissues or fluids isolated therefrom. The 
term "modulate" refers to a compound's ability to 
increase (e.g., via an agonist) or inhibit (e.g., via an 
antagonist) the biological activity of an invention NAC 
protein, such as the capablity of binding CARD-containing 
proteins, NB-ARC-containing proteins, to modulate the 
activity of proteases such as caspases, to modulate the 
activity of NF-kB, and to modulate apoptosis. 
Accordingly, compositions comprising a carrier and an 
amount of an antibody having specificity for NAC 
polypeptides effective to inhibit naturally occurring 
ligands or NAPs from binding to invention NAC 
polypeptides are contemplated herein. For example, a 
monoclonal antibody directed to an epitope of an 
invention NAC polypeptide including an amino acid 



sequence set forth in SEQ ID N0s:2, 4, 6, 10 or 12, can 
be useful for this purpose. 

The present invention further provides transgenic 
non-human mammals that are capable of expressing 
exogenous nucleic acids encoding NAC polypeptides. As 
employed herein, the phrase "exogenous nucleic acid" 
refers to nucleic acid sequence which is not native to 
the host, or which is present in the host in other than 
its native environment (e.g., as part of a genetically 
engineered DNA construct) . In addition to naturally 
occurring levels of NAC, invention NAC can either be 
overexpressed or underexpressed (such as in the 
well-known knock-out transgenics) in transgenic mammals. 

Also provided are transgenic non-human mammals 
capable of expressing nucleic acids encoding NAC 
polypeptides so mutated as to be incapable of normal 
activity, i.e., do not express native NAC. The present 
invention also provides transgenic non-human mammals 
having a genome comprising antisense nucleic acids 
complementary to nucleic acids encoding NAC polypeptides 
placed so as to be transcribed into antisense mRNA 
complementary to mRNA encoding NAC polypeptides, which 
hybridizes to the mRNA and, thereby, reduces the 
translation thereof. The nucleic acid may additionally 
comprise an inducible promoter and/or tissue specific 
regulatory elements, so that expression can be induced, 
or restricted to specific cell types. Examples of 
nucleic acids are DNA or cDNA having a coding sequence 
substantially the same as the coding sequence shown in 
SEQ ID NOs :1, 3 or 5. An example of a non-human 
transgenic mammal is a transgenic mouse. Examples of 
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tissue specificity-determining elements are the 
metallothionein promoter and the L7 promoter. 

Animal model systems which elucidate the 
physiological and behavioral roles of NAC polypeptides 
are also provided, and are produced by creating 
transgenic animals in which the expression of the NAC 
polypeptide is altered using a variety of techniques. 
Examples of such techniques include the insertion of 
normal or mutant versions of nucleic acids encoding an 
NAC polypeptide by microinj ection, retroviral infection 
or other means well known to those skilled in the art, 
into appropriate fertilized embryos to produce a 
transgenic animal. (See, for example, Hogan et al . , 
Manipulating the Mouse Embryo: A Laboratory Manual (Cold 
Spring Harbor Laboratory, (1986) ) . 

Also contemplated herein, is the use of homologous 
recombination of mutant or normal versions of NAC genes 
with the native gene locus in transgenic animals, to 
alter the regulation of expression or the structure of 
NAC polypeptides (see, Capecchi et al . , Science 244:1288 
(1989); Zimmer et al . , Nature 338:150 (1989); which are 
incorporated herein by reference) . Homologous 
recombination techniques are well known in the art. 
Homologous recombination replaces the native (endogenous) 
gene with a recombinant or mutated gene to produce an 
animal that cannot express native (endogenous) protein 
but can express, for example, a mutated protein which 
results in altered expression of NAC polypeptides. 

In contrast to homologous recombination, 
microinjection adds genes to the host genome, without 
removing host genes. Microinjection can produce a 
transgenic animal that is capable of expressing both 
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endogenous and exogenous NAC. Inducible promoters can be 
linked to the coding region of nucleic acids to provide a 
means to regulate expression of the transgene. Tissue 
specific regulatory elements can be linked to the coding 
5 region to permit tissue-specific expression of the 

transgene. Transgenic animal model systems are useful 
for in vivo screening of compounds for identification of 
specific ligands, i.e., agonists and antagonists, which 
activate or inhibit NAC protein responses. 

10 

A further embodiment of the invention provides a 
method to identify agents that can effectively alter NAC 
activity, for example the ability of NAC to association 
with one or more heterologous proteins. Thus, the 

15 present invention provides a screening assay useful for 
identifying an effective agent, which can alter the 
association of a NAC with a NAC associated protein, such 
as a CARD-containing protein and/or an NB-ARC-containing 
protein. Since CARD-containing proteins and NB-ARC- 

20 containing proteins are involved in apoptosis, the 

identification of such effective agents can be useful for 
modulating the level of apoptosis in a cell in a subject 
having a pathology characterized by an increased or 
decreased level of apoptosis. 

25 

Further, since invention NAC proteins comprise CARD 
domains, effective agents can be useful for modulation of 
any other CARD domain activity. These additional CARD 
domain activities include, for example, NF-kB activity 
30 modulation, cytokine receptor signal transduction, and 
caspase activation/inhibition, regardless of whether the 
effected caspase is involved in apoptosis or some 
alternative cellular process such as proteolytic 
processing and activation of inflammatory cytokines. 
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As used herein, the term "agent" means a chemical or 
biological molecule such as a simple or complex organic 
molecule, a peptide, a peptido-mimetic, a protein or an 
oligonucleotide that has the potential for altering the 
association of NAC with a heterologous protein or 
altering the ability of NAC to self-associate or altering 
the nucleotide binding and/or hydrolysis activity of NAC. 
In addition, the term "effective agent" is used herein to 
mean an agent that can, in fact, alter the association of 
NAC with a heterologous protein or altering the ability 
of NAC to self-associate or altering the nucleotide 
binding and/or hydrolysis activity of NAC. For example, 
an effective agent may be an anti-NAC antibody or a NAC- 
associated-protein . 

As used herein, the term "alter the association" 
means that the association between two specifically 
interacting proteins either is increased or is decreased 
due to the presence of an effective agent. As a result 
of an altered association of NAC with another protein in 
a cell, the activity of the NAC or the NAC associated 
protein can be increased or decreased, thereby modulating 
a biological process, for example, the level of apoptosis 
in the cell. As used herein, the term "alter the 
activity" means that the agent can increase or decrease 
the activity of a NAC in a cell, thereby modulating a 
biological process in a cell, for example, the level of 
apoptosis in the cell. For example, an effective agent 
can increase or decrease the NB-ARC : NB-ARC-associating 
activity of a NAC, without affecting the association of 
the NAC with a CARD-containing protein. Modulation of 
the ATP hydrolysis activity can modulate the ability of 
NAC proteins to associate with other NB-ARC-containing 
proteins, such as Apaf-1, thereby modulating any process 
effected by such association between NAC and an 
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NB-ARC-containing protein. Similarly, the term "alters 
the association" of NAC with another protein refers to 
increasing or decreasing, or otherwise changing the 
association between a NAC and a protein that specifically 
binds to NAC (i.e., a NAC associated protein). 

An effective agent can act by interfering with the 
ability of a NAC to associate with another protein, or 
can act by causing the dissociation of NAC from a complex 
with a NAC-associated protein, wherein the ratio of bound 
NAC to free NAC is related to the level of a biological 
process, for example, apoptosis, in a cell. For example, 
binding of a ligand to a NAC-associated protein can allow 
the NAC-associated protein, in turn, to bind a NAC. The 
association, for example, of a CARD-containing protein 
and a NAC can result in activation or inhibition of the 
NB-ARC : NB-ARC-associating activity of NAC. In the 
presence of an effective agent, the association of a NAC 
and a CARD-containing protein can be altered, which can 
thereby alter the activation of caspases in the cell. As 
a result of the altered caspase activation, the level of 
apoptosis in a cell can be increased or decreased. Thus, 
the identification of an effective agent that alters the 
association of NAC with another protein can allow for the 
use of the effective agent to increase or decrease the 
level of apoptosis in a cell. 

An effective agent can be useful, for example, to 
increase the level of apoptosis in a cell such as a 
cancer cell, which is characterized by having a decreased 
level of apoptosis as compared to its normal cell 
counterpart. An effective agent also can be useful, for 
example, to decrease the level of apoptosis in a cell 
such as a T lymphocyte in a subject having a viral 
disease such as acquired immunodeficiency syndrome, which 
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is characterized by an increased level of apoptosis in an 
infected T cell as compared to a normal T cell. Thus, an 
effective agent can be useful as a medicament for 
altering the level of apoptosis in a subject having a 
pathology characterized by increased or decreased 
apoptosis. In addition, an effective agent can be used, 
for example, to decrease the level of apoptosis and, 
therefore, increase the survival time of a cell such as a 
hybridoma cell in culture. The use of an effective agent 
to prolong the survival of a cell in vitro can 
significantly improve bioproduction yields in industrial 
tissue culture applications. 

A NAC that lacks the ability to bind the NB-ARC 
domain of another protein but retains the ability to 
self-associate via its CARD domain or to bind to other 
CARD-containing proteins is an example of an effective 
agent, since the expression of a non-NB-ARC-associating 
NAC in a cell can alter the association of a the 
endogenous NAC protein with itself or with NAC associated 
proteins . 

Thus, it should be recognized that a mutation of a 
NAC can be an effective agent, depending, for example, on 
the normal level of NAC/NAC-associated protein that 
occurs in a particular cell type. In addition, an active 
fragment of a NAC can be an effective agent, provided the 
active fragment can alter the association of NAC and 
another protein in a cell. Such active fragments, which 
can be peptides as small as about five amino acids, can 
be identified, for example, by screening a peptide 
library (see, for example, Ladner et al . , U.S. Patent No: 
5,223,409, which is incorporated herein by reference) to 
identify peptides that can bind a NAC-associated protein. 
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Similarly, a peptide or polypeptide portion of a 
NAC-associated protein also can be an effective agent. A 
peptide such as the C-terminal peptide of NAC-associated 
protein can be useful, for example, for decreasing the 
5 association of NAC with a CARD-containing protein or a 
NB-ARC-containing protein in a cell by competing for 
binding to the NAC. A non-naturally occurring 
peptido-mimetic also can be useful as an effective agent. 
Such a peptido-mimetic can include, for example, a 
10 peptoid, which is peptide-like sequence containing 
N-substituted glycines, or an oligocarbamate . A 
peptido-mimetic can be particularly useful as an 
effective agent due, for example, to having an increased 
stability to enzymatic degradation in vivo. 

15 

A screening assay to identify an effective agent can 
be performed in vivo using the two hybrid system or can 
be performed in vitro as disclosed herein. The yeast two 
hybrid system, for example, can be used to screen a panel 

20 of agents to identify effective agents that alter the 
association of NAC with another protein. An effective 
agent can be identified by detecting an altered level of 
transcription of a reporter gene. For example, the level 
of transcription of a reporter gene due to the bridging 

25 of a DNA-binding domain and trans-activation domain by a 
NAP and NAC hybrids can be determined in the absence and 
in the presence of an agent. An effective agent, which 
alters the association between NAC and another protein, 
can be identified by a proportionately altered level of 

30 transcription of the reporter gene as compared to the 
control level of transcription in the absence of the 
agent . 

As understood by those of skill in the art, assay 
35 methods for identifying agents that modulate NAC activity 
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generally require comparison to a control. For example, 
one type of a "control" is a cell or culture that is 
treated substantially the same as the test cell or test 
culture exposed to the agent, with the distinction that 
the "control" cell or culture is not exposed to the 
agent. Another type of "control" cell or culture may be 
a cell or culture that is identical to the transfected 
cells, with the exception that the "control" cell or 
culture do not express native proteins. Accordingly, the 
response of the transfected cell to agent is compared to 
the response (or lack thereof) of the "control" cell or 
culture to the same agent under the same reaction 
conditions. Similarly, a "control" can be the extract, 
partially purified or not, of a cell not exposed to the 
agent or not expressing certain native proteins. A 
"control" may also be an isolated compound, for example, 
a protein (e.g., Skp-1 as used in Examples), which is 
known to not specifically associate with NAC proteins. 

Accordingly, in accordance with another embodiment 
of the present invention, there is provided a method of 
identifying an effective agent that alters the 
association of a NB-ARC and CARD-containing protein (NAC) 
with a NAC associated protein (NAP), by the steps of: 

a. contacting said NAC and NAP proteins, 
under conditions that allow the NAC and NAP proteins 
to associate, with an agent suspected of being able 
to alter the association of the NAC and NAP 
proteins ; and 

b. detecting the altered association of the 
NAC and NAP proteins, wherein the altered 
association identifies an effective agent. 



Methods well-known in the art for detecting the 
altered association of the NAC and NAP proteins, for 
example, measuring protein :protein binding, protein 
degradation or apoptotic activity can be employed in 
bioassays described herein to identify agents as agonists 
or antagonists of NAC proteins. As described herein, NAC 
proteins have the ability to self-associate. Thus, 
methods for identifying effective agents that alter the 
association of a NAC protein NAP will also be useful for 
identifying effective agents that alter the ability of 
NAC to self-associate. Similarly, CARD-X proteins have 
the ability to interact with other CARD-containing 
proteins and to self-associate. Thus, methods for 
identifying effective agents that alter the association 
of a NAC and another protein will also be useful for 
identifying effective agents that alter the ability of 
CARD-X to self-associate or to associate with a 
heterologous CARD-containing protein. 

As used herein, "'conditions that allow said NAC and 
NAP proteins to associate" refers to environmental 
conditions in which NAC : NAP specifically associate. Such 
conditions will typically be aqueous conditions, with a 
pH between 3.0 and 11.0, and temperature below 100°C. 
Preferably, the conditions will be aqueous conditions 
with salt concentrations below the equivalent of 1 M 
NaCl, and pH between 5.0 and 9.0, and temperatures 
between 0°C and 50°C. Most preferably, the conditions 
will range from physiological conditions of normal yeast 
or mammalian cells, or conditions favorable for carrying 
out in vitro assays such as immunoprecipitation and GST- 
NAC : NAP association assays, and the like. 

In yet another embodiment of the present invention, 
there are provided methods for modulating the caspase 



modulating activity mediated by NAC proteins, the method 
comprising : 

contacting an NAC protein with an effective, 
modulating amount of an agonist or antagonist identified 
5 by the above-described bioassays. 

The present invention also provides in vitro 
screening assays. Such screening assays are particularly 
useful in that they can be automated, which allows for 

10 high through-put screening, for example, of randomly or 
rationally designed agents such as drugs, peptidomimetics 
or peptides in order to identify those agents that 
effect ively alter the association of NAC and NAP proteins 
or the activity of a NAC and, thereby, modulate 

15 apoptosis. An in vitro screening assay can utilize, for 
example, a NAC or a NAC fusion protein such as a 
NAC-glutathione-S-transf erase fusion protein (GST /NAC; 
see Examples) . For use in the in vitro screening assay, 
the NAC or NAC fusion protein should have an affinity for 

20 a solid substrate as well as the ability to associate 

with a NAC-associated protein. For example, when a NAC 
is used in the assay, the solid substrate can contain a 
covalently attached anti-NAC antibody. Alternatively, a 
GST/NAC fusion protein can be used in the assay and the 

25 solid substrate can contain covalently attached 

glutathione, which is bound by the GST component of the 
GST/NAC fusion protein. Similarly, a NAC-associated 
protein, or a GST/CARD-containing protein or 
GST/NB-ARC-containing protein fusion protein can be used 

30 in an in vitro assay as described herein. 



An in vitro screening assay can be performed by 
allowing a NAC or NAC-fusion protein, for example, to 
bind to the solid support, then adding a NAC-associated 
35 protein and an agent to be tested. Control reactions, 



which do not contain an agent, can be performed in 
parallel. Following incubation under suitable 
conditions, which include, for example, an appropriate 
buffer concentration and pH and time and temperature that 
permit binding of the particular NAC and NAC-associated 
protein, the amount of protein that has associated in the 
absence of an agent and in the presence of an agent can 
be determined. The association of a NAC-associated 
protein with a NAC protein can be detected, for example, 
by attaching a detectable moiety such as a radionuclide 
or a fluorescent label to a NAC-associated protein and 
measuring the amount of label that is associated with the 
solid support, wherein the amount of label detected 
indicates the amount of association of the NAC-associated 
protein with a NAC protein. An effective agent is 
determined by comparing the amount of specific binding in 
the presence of an agent as compared to the control level 
of binding, wherein an effective agent alters the 
association of NAC with the NAC-assocated protein. Such 
an assay is particularly useful for screening a panel of 
agents such as a peptide library in order to detect an 
effective agent. 

The invention further provides methods for 
introducing a nucleic acid encoding a NAC into a cell in 
a subject, for example, for gene therapy. Viruses are 
specialized infectious agents that can elude host defense 
mechanisms and can infect and propagate in specific cell 
types. Viral based systems provide the advantage of 
being able to introduce relatively high levels of the 
heterologous nucleic acid into a variety of cells. 
Suitable viral vectors for introducing invention nucleic 
acid encoding an NAC protein into mammalian cells (e.g., 
vascular tissue segments) are well known in the art. 
These viral vectors include, for example, Herpes simplex 
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virus vectors (e.g., Geller et al . , Science , 
241:1667-1669 (1988)), Vaccinia virus vectors (e.g., 
Piccini et al., Meth. in Enz ymology , 153:545-563 (1987); 
Cytomegalovirus vectors (Mocarski et al . , in Viral 
Vectors , Y. Gluzman and S.H. Hughes, Eds., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y., 1988, pp. 
78-84), Moloney murine leukemia virus vectors (Danos et 
al., Proc. Natl. Acad. Sci., USA , 85:6469 (1980)), 
adenovirus vectors (e.g., Logan et al., Proc. Natl. Acad. 
Sci., USA , 81:3655-3659 (1984); Jones et al . , Cell , 
17:683-689 (1979); Berkner, Biotechniques , 6:616-626 
(1988); Cotten et al . , Proc. Natl. Acad. Sci., USA , 
89:6094-6098 (1992); Graham et al . , Meth. Mol. Biol. , 
7:109-127 (1991)), adeno-associated virus vectors, 
retrovirus vectors (see, e.g., U.S. Patent 4, 405,712 and 
4,650,764), and the like. Especially preferred viral 
vectors are the adenovirus and retroviral vectors. 

Suitable retroviral vectors for use herein are 
described, for example, in U.S. Patent 5,252,479, and in 
WIPO publications WO 92/07573, WO 90/06997, WO 89/05345, 
WO 92/05266 and WO 92/14829, incorporated herein by 
reference, which provide a description of methods for 
efficiently introducing nucleic acids into human cells 
using such retroviral vectors. Other retroviral vectors 
include, for example, the mouse mammary tumor virus 
vectors (e.g., Shackleford et al . , Proc. Natl. Acad. Sci. 
USA, 85:9655-9659 (1988)), and the like. 

In particular, the specificity of viral vectors for 
particular cell types can be utilized to target 
predetermined cell types. Thus, the selection of a viral 
vector will depend, in part, on the cell type to be 
targeted. For example, if a neurodegenerative disease is 
to be treated by increasing the level of a NAC in 
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neuronal cells affected by the disease, then a viral 
vector that targets neuronal cells can be used. A vector 
derived from a herpes simplex virus is an example of a 
viral vector that targets neuronal cells (Battleman et 
al., J. Neurosci. 13:941-951 (1993), which is 
incorporated herein by reference) . Similarly, if a 
disease or pathological condition of the hematopoietic 
system is to be treated, then a viral vector that is 
specific for a particular blood cell or its precursor 
cell can be used. A vector based on a human 
immunodeficiency virus is an example of such a viral 
vector (Carroll et al . , J* Cell. Biochem. 17E:241 (1993), 
which is incorporated herein by reference) . In addition, 
a viral vector or other vector can be constructed to 
express a nucleic acid encoding a NAC in a tissue 
specific manner by incorporating a tissue-specific 
promoter or enhancer into the vector (Dai et al., Proc . 
Natl. Acad. Sci. USA 89:10892-10895 (1992), which is 
incorporated herein by reference) . 

For gene therapy, a vector containing a nucleic acid 
encoding a NAC or an antisense nucleotide sequence can be 
administered to a subject by various methods. For 
example, if viral vectors are used, administration can 
take advantage of the target specificity of the vectors. 
In such cases, there in no need to administer the vector 
locally at the diseased site. However, local 
administration can be a particularly effective method of 
administering a nucleic acid encoding a NAC. In 
addition, administration can be via intravenous or 
subcutaneous injection into the subject. Following 
injection, the viral vectors will circulate until they 
recognize host cells with the appropriate target 
specificity for infection. Injection of viral vectors 
into the spinal fluid also can be an effective mode of 
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administration, for example, in 
neurodegenerative disease . 



treating 



a 



Receptor-mediated DNA delivery approaches also can 
5 be used to deliver a nucleic acid molecule encoding a NAC 
into cells in a tissue-specific manner using a 
tissue-specific ligand or an antibody that is 
non-covalently complexed with the nucleic acid molecule 
via a bridging molecule (Curiel et al., Hum. Gene Ther. 

10 3:147-154 (1992); Wu and Wu, J. Biol. Chem. 2 62:4429-4432 
(1987), each of which is incorporated herein by 
reference) . Direct injection of a naked or a nucleic 
acid molecule encapsulated, for example, in cationic 
liposomes also can be used for stable gene transfer into 

15 non-dividing or dividing cells in vivo (Ulmer et al., 
Science 259:1745-174 8 (1993), which is incorporated 
herein by reference) . In addition, a nucleic acid 
molecule encoding a NAC can be transferred into a variety 
of tissues using the particle bombardment method 

20 (Williams et al., Proc. Natl. Acad. Sci. USA 88:2726-2730 
(1991), which is incorporated herein by reference). Such 
nucleic acid molecules can be linked to the appropriate 
nucleotide sequences required for transcription and 
translation . 

25 

A particularly useful mode of administration of a 
nucleic acid encoding a NAC is by direct inoculation 
locally at the site of the disease or pathological 
condition. Local administration can be advantageous 

30 because there is no dilution effect and, therefore, the 
likelihood that a majority of the targeted cells will be 
contacted with the nucleic acid molecule is increased. 
Thus, local inoculation can alleviate the targeting 
requirement necessary with other forms of administration 

35 and, if desired, a vector that infects all cell types in 
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the inoculated area can be used. If expression is 
desired in only a specific subset of cells within the 
inoculated area, then a promotor, an enhancer or other 
expression element specific for the desired subset of 
5 cells can be linked to the nucleic acid molecule. 
Vectors containing such nucleic acid molecules and 
regulatory elements can be viral vectors, viral genomes, 
plasmids, phagemids and the like. Transfection vehicles 
such as liposomes also can be used to introduce a 
10 non-viral vector into recipient cells. Such vehicles are 
well known in the art. 

The present invention also provides therapeutic 
compositions useful for practicing the therapeutic 
methods described herein. Therapeutic compositions of 
the present invention, such as pharmaceutical 
compositions, contain a physiologically compatible 
carrier together with an invention NAC (or functional 
fragment thereof) , a NAC modulating agent, such as a 
compound (agonist or antagonist) identified by the 
methods described herein, or an anti-NAC antibody, as 
described herein, dissolved or dispersed therein as an 
active ingredient. In a preferred embodiment, the 
therapeutic composition is not immunogenic when 
administered to a mammal or human patient for therapeutic 
purposes . 

As used herein, the terms "pharmaceutically 
acceptable", "physiologically compatible" and grammatical 
30 variations thereof, as they refer to compositions, 

carriers, diluents and reagents, are used interchangeably 
and represent that the materials are capable of 
administration to a mammal without the production of 
undesirable physiological effects such as nausea, 
35 dizziness, gastric upset, and the like. 



The preparation of a pharmacological composition 
that contains active ingredients dissolved or dispersed 
therein is well known in the art. Typically such 
compositions are prepared as injectables either as liquid 
solutions or suspensions; however, solid forms suitable 
for solution, or suspension, in liquid prior to use can 
also be prepared. The preparation can also be 
emulsified. 

The active ingredient can be mixed with excipients 
which are pharmaceutically acceptable and compatible with 
the active ingredient in amounts suitable for use in the 
therapeutic methods described herein. Suitable 
excipients are, for example, water, saline, dextrose, 
glycerol, ethanol, or the like, as well as combinations 
of any two or more thereof. In addition, if desired, the 
composition can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH 
buffering agents, and the like, which enhance the 
effectiveness of the active ingredient. 

The therapeutic composition of the present invention 
can include pharmaceutically acceptable salts of the 
components therein. Pharmaceutically acceptable nontoxic 
salts include the acid addition salts (formed with the 
free amino groups of the polypeptide) that are formed 
with inorganic acids such as, for example, hydrochloric 
acid, hydrobromic acid, perchloric acid, nitric acid, 
thiocyanic acid, sulfuric acid, phosphoric acid, acetic 
acid, propionic acid, glycolic acid, lactic acid, pyruvic 
acid, oxalic acid, malonic acid, succinic acid, maleic 
acid, fumaric acid, anthranilic acid, cinnamic acid, 
naphthalene sulfonic acid, sulfanilic acid, and the like. 
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Salts formed with the free carboxyl groups can also 
be derived from inorganic bases such as, for example, 
sodium hydroxide, ammonium hydroxide, potassium 
hydroxide, and the like; and organic bases such as mono-, 
di-, and tri-alkyl and -aryl amines (e.g., triethylamine, 
diisopropyl amine, methyl amine, dimethyl amine, and the 
like) and optionally substituted ethanolamines (e.g., 
ethanolamine, diethanolamine, and the like) . 

Physiologically tolerable carriers are well known in 
the art. Exemplary liquid carriers are sterile aqueous 
solutions that contain no materials in addition to the 
active ingredients and water, or contain a buffer such as 
sodium phosphate at physiological pH, physiological 
saline or both, such as phosphate-buffered saline. Still 
further, aqueous carriers can contain more than one 
buffer salt, as well as salts such as sodium and 
potassium chlorides, dextrose, polyethylene glycol and 
other solutes. 

Liquid compositions can also contain liquid phases 
in addition to and to the exclusion of water. Exemplary 
additional liquid phases include glycerin, vegetable oils 
such as cottonseed oil, and water-oil emulsions. 

As described herein, an "effective amount" is a 
predetermined amount calculated to achieve the desired 
therapeutic effect, e.g., to modulate the protein 
degradation activity of an invention NAC protein. The 
required dosage will vary with the particular treatment 
and with the duration of desired treatment; however, it 
is anticipated that dosages between about 10 micrograms 
and about 1 milligram per kilogram of body weight per day 
will be used for therapeutic treatment. It may be 
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particularly advantageous to administer such compounds in 
depot or long-lasting form as discussed hereinafter. A 
therapeutically effective amount is typically an amount 
of an NAC-modulating agent or compound identified herein 
5 that, when administered in a physiologically acceptable 
composition, is sufficient to achieve a plasma 
concentration of from about 0.1 yg/ml to about 100 ]ig/ml, 
preferably from about 1.0 pg/ml to about 50 jig/ml, more 
preferably at least about 2 jag/ml and usually 5 to 10 
10 pg/ml . Therapeutic invention anti-NAC antibodies can be 
administered in proportionately appropriate amounts in 
accordance with known practices in this art. 



Also provided herein are methods of treating 
15 pathologies, said method comprising administering an 

effective amount of an invention therapeutic composition. 
Such compositions are typically administered in a 
physiologically compatible composition . 



20 Exemplary diseases related to abnormal cell 

proliferation contemplated herein for treatment according 
to the present invention include cancer pathologies, 
keratinocyte hyperplasia, neoplasia, keloid, benign 
prostatic hypertrophy, inflammatory hyperplasia, 

25 fibrosis, smooth muscle cell proliferation in arteries 

following balloon angioplasty (restenosis), and the like. 
Exemplary cancer pathologies contemplated herein for 
treatment include, gliomas, carcinomas, adenocarcinomas, 
sarcomas, melanomas, hamartomas, leukemias, lymphomas, 

30 and the like. 



Methods of treating pathologies of abnormal cell 
proliferation will include methods of modulating the 
activity of one or more oncogenic proteins, wherein the 
35 oncogenic proteins specifically interact with NAC . 
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Methods of modulating the activity of such oncogenic 
proteins will include contacting the oncogenic protein 
with a substantially pure NAC or an active fragment 
(i.e., oncogenic protein-binding fragment) thereof. This 
contacting will modulate the activity of the oncogenic 
protein, thereby providing a method of treating a 
pathology caused by the oncogenic protein. Further 
methods of modulating the activity of oncogenic proteins 
will include contacting the oncogenic protein with an 
agent, wherein the agent modulates the interactions 
between NAC and the oncogenic protein. 

Also contemplated herein, are therapeutic methods 
using invention pharmaceutical compositions for the 
treatment of pathological disorders in which there is too 
little cell division, such as, for example, bone marrow 
aplasias, immunodeficiencies due to a decreased number of 
lymphocytes, and the like. Methods of treating a variety 
of inflammatory diseases with invention therapeutic 
compositions are also contemplated herein, such as 
treatment of sepsis, fibrosis (e.g., scarring), 
arthritis, graft versus host disease, and the like. 

The present invention also provides methods for 
diagnosing a pathology that is characterized by an 
increased or decreased level of apoptosis in a cell to 
determine whether the increased or decreased level of 
apoptosis is due, for example, to increased or decreased 
expression of a NAC in the cell or to expression of a 
variant NAC. The identification of such a pathology, 
which can be due to altered association of a NAC with a 
NAC-associated protein in a cell, can allow for 
intervention therapy using an effective agent or a 
nucleic acid molecule or an antisense nucleotide sequence 
as described above. In general, a test sample can be 



obtained from a subject having a pathology characterized 
by having or suspected of having increased or decreased 
apoptosis and can be compared to a control sample from a 
normal subject to determine whether a cell in the test 
5 sample has, for example, increased or decreased 

expression of NAC . The level of a NAC in a cell can be 
determined by contacting a sample with a reagent such as 
an anti-NAC antibody or a NAC-associated protein, either 
of which can specifically bind a NAC. For example, the 

10 level of a NAC in a cell can determined by well known 
immunoassay or immunohistochemical methods using an 
anti-NAC antibody (see, for example, Reed et al . , supra, 
1992/ see, also, Harlow and Lane, supra, (1988)). As 
used herein, the term "reagent" means a chemical or 

15 biological molecule that can specifically bind to a NAC 
or to a bound NAC/NAC-associated protein complex. For 
example, either an anti-NAC antibody or a NAC-associated 
protein can be a reagent for a NAC, whereas either an 
anti-NAC antibody or an anti-NAC-associated protein 

20 antibody can be a reagent for a NAC/NAC-associated 
protein complex . 

As used herein, the term "test sample" means a cell 
or tissue specimen that is obtained from a subject and is 

25 to be examined for expression of a NAC in a cell in the 
sample. A test sample can be obtained, for example, 
during surgery or by needle biopsy and can be examined 
using the methods described herein to diagnose a 
pathology characterized by increased or decreased 

30 apoptosis. Increased or decreased expression of a NAC in 
a cell in a test sample can be determined by comparison 
to an expected normal level for a NAC in a particular 
cell type. A normal range of NAC levels in various cell 
types can be determined by sampling a statistically 

35 significant number of normal subjects. In addition, a 
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control sample can be evaluated in parallel with a test 
sample in order to determine whether a pathology 
characterized by increased or decreased apoptosis is due 
to increased or decreased expression of a NAC. The test 
5 sample can be examined using, for example, 

immunohistochemical methods as described above or the 
sample can be further processed and examined. For 
example, an extract of a test sample can be prepared and 
examined to determine whether a NAC that is expressed in 
10 a cell in the sample can associate with a NAC-associated 
protein in the same manner as a NAC from a control cell 
or whether, instead, a variant NAC is expressed in the 
cell . 

15 In accordance with another embodiment of the present 

invention, there are provided diagnostic systems, 
preferably in kit form, comprising at least one invention 
nucleic acid encoding NAC, NAC protein, and/or anti-NAC 
antibody described herein, in a suitable packaging 

20 material. In one embodiment, for example, the diagnostic 
nucleic acids are derived from any of SEQ ID NOs :1, 3 
and 5. Invention diagnostic systems are useful for 
assaying for the presence or absence of nucleic acid 
encoding NAC in either genomic DNA or in transcribed 

25 nucleic acid (such as mRNA or cDNA) encoding NAC. 

A suitable diagnostic system includes at least one 
invention NAC nucleic acid, NAC protein, and/or anti-NAC 
antibody, preferably two or more invention nucleic acids, 

30 proteins and/or antibodies, as a separately packaged 

chemical reagent (s) in an amount sufficient for at least 
one assay. Instructions for use of the packaged reagent 
are also typically included. Those of skill in the art 
can readily incorporate invention nucleic probes and/or 

35 primers into kit form in combination with appropriate 



buffers 
methods 
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and solutions for the practice 
as described herein. 



of the 



invention 



As employed herein, the phrase "packaging material" 
5 refers to one or more physical structures used to house 
the contents of the kit, such as invention nucleic acid 
probes or primers, and the like. The packaging material 
is constructed by well known methods, preferably to 
provide a sterile, contaminant-free environment* The 

10 packaging material has a label which indicates that the 
invention nucleic acids can be used for detecting a 
particular sequence encoding NAC including the nucleotide 
sequences set forth in SEQ ID NOs :1, 3 and 5 or 
mutations or deletions therein, thereby diagnosing the 

15 presence of, or a predisposition for, cancer. In 

addition, the packaging material contains instructions 
indicating how the materials within the kit are employed 
both to detect a particular sequence and diagnose the 
presence of, or a predisposition for, cancer. 

20 

The packaging materials employed herein in relation 
to diagnostic systems are those customarily utilized in 
nucleic acid-based diagnostic systems. As used herein, 
.the term "package" refers to a solid matrix or material 

25 such as glass, plastic, paper, foil, and the like, 
capable of holding within fixed limits an isolated 
nucleic acid, oligonucleotide, or primer of the present 
invention. Thus, for example, a package can be a glass 
vial used to contain milligram quantities of a 

30 contemplated nucleic acid, oligonucleotide or primer, or 
it can be a microtiter plate well to which microgram 
quantities of a contemplated nucleic acid probe have been 
operative ly affixed . 
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"Instructions for use" typically include a tangible 
expression describing the reagent concentration or at 
least one assay method parameter, such as the relative 
5 amounts of reagent and sample to be admixed, maintenance 
time periods for reagent/sample admixtures, temperature, 
buffer conditions, and the like* 

A diagnostic assay should include a simple method 
10 for detecting the amount of a NAC in a sample that is 
bound to the reagent* Detection can be performed by 
labeling the reagent and detecting the presence of the 
label using well known methods (see, for example, Harlow 
and Lane, supra, 1988; chap. 9, for labeling an 
15 antibody) . A reagent can be labeled with various 

detectable moieties including a radiolabel, an enzyme, 
biotin or a f luorochrome . Materials for labeling the 
reagent can be included in the diagnostic kit or can be 
purchased separately from a commercial source. Following 
20 contact of a labeled reagent with a test sample and, if 

desired, a control sample, specifically bound reagent can 
be identified by detecting the particular moiety. 

A labeled antibody that can specifically bind the 
25 reagent also can be used to identify specific binding of 
an unlabeled reagent. For example, if the reagent is an 
anti-NAC antibody, a second antibody can be used to 
detect specific binding of the anti-NAC antibody. A 
second antibody generally will be specific for the 
30 particular class of the first antibody. For example, if 
an anti-NAC antibody is of the IgG class, a second 
antibody will be an anti-IgG antibody. Such second 
antibodies are readily available from commercial sources. 
The second antibody can be labeled using a detectable 
35 moiety as described above. When a sample is labeled 



using a second antibody, the sample is first contacted 
with a first antibody, then the sample is contacted with 
the labeled second antibody, which specifically binds to 
the first antibody and results in a labeled sample. 

5 

In accordance with another embodiment of the 
invention, a method is provided to identify NAC- 
associated proteins. As used herein, the term 
"NAC-associated protein" or "NAP" means a protein that 

10 can specifically bind to NAC or its alternative isoforms. 
Because NAC proteins are known to self-associate, NAC 
proteins are encompassed by the term NAP. An exemplary 
NAP is a protein or a polypeptide portion of a protein 
that can bind the NB-ARC, CARD, LRR, or TIM-Barrel-like 

15 domains of NAC. Similarly, the term "CARD-X Associated 
Protein" or "CAP" refers to a protein that can bind 
specifically to the CARD-X protein. Likewise, since 
CARD-X proteins are known to self -associate, CARD-X 
proteins are encompassed by the term CAP. A NAP or CAP 

20 can be identified, for example, using in vitro protein 
binding assays similar to those described in the 
Examples, by Yeast Two-Hybrid assays similar to those 
described in the Examples, or by other types of 
protein-interaction assays and methods. 

25 

Using NAC or CARD-X, it is clear to one skilled in 
the art of protein purification, protein interaction 
cloning, or protein mass-spectrometry, that NAPs or CAPs 
can be identified using the methods disclosed herein. 

30 

Although the term "NAP" or "CAP" is used generally, 
it should be recognized that a NAP or CAP that is 
identified using an assay described herein can be a 
portion of a protein, which is considered to be a 
35 candidate NAP or CAP. As used herein, the term "active 



70 

fragment" of a NAP or CAP refers to a protein that 
corresponds to a polypeptide sequence that can bind NAC 
or CARD-X, respectively, but that consists of only a 
portion of the full length protein. Although such 
5 polypeptides are considered NAPs or CAPs, it is well 

known that a cDNA sequence obtained from a cDNA library 
may not encode the full length protein. Thus, a cDNA can 
encode a polypeptide that is only a portion of a full 
length protein but, nevertheless, assumes an appropriate 

10 conformation and contains a sufficient region so as to 

bind NAC or CARD-X . However, in the full length protein, 
the polypeptide can assume a conformation that does not 
bind NAC or CARD-X, due for example to steric blocking of 
the NAP or CAP binding site. Such a full length protein 

15 is also an example of a NAP or CAP, wherein NAC-binding 
or CARD-X-binding activity can be activated under the 
appropriate conditions (i.e., phosphorylation, 
proteolysis, protein binding, pH change, and the like) . 
For convenience of discussion, the terms "NAP" and "CAP", 

20 as used herein, are intended to include a NAP or CAP, 
respectively, and active fragments thereof. 

Since CARD-containing proteins are commonly involved 
in apoptosis, the association of a NAP or CAP with NAC or 

25 CARD-X can affect the level of apoptosis in a cell. The 
identification by use of the methods described herein of 
various NAPs or CAPs can provide the necessary insight 
into cell death or signal transduction pathways 
controlled by NAC or CARD-X, allowing for the development 

30 of assays that are useful for identifying agents that 

effectively alter the association of a NAP with NAC or a 
CAP with CARD-X. Such agents can be useful, for example, 
for providing effective therapy for a cancer in a subject 
or for treating an autoimmune disease. These same assays 

35 can be used for identification of agents that modulate 



the self-association of NAC via its CARD domain, NB-ARC 
domain, or other domains within this protein; and, they 
can be used for identification of agents that modulate 
the self-association of CARD-X with itself via its CARD 
5 domain or other domains found within this protein. 

In a normal cell, a steady state level of 
association of NAP and NAC proteins likely occurs* This 
steady state level of association of NAP and NAC proteins 

10 in a particular cell type can determine the normal level 
of apoptosis in that cell type. An increase or decrease 
in the steady state level of association of NAP and NAC 
proteins in a cell can result in an increased or 
decreased level of apoptosis in the cell, which can 

15 result in a pathology in a subject. The normal 

association of NAP and NAC proteins in a cell can be 
altered due, for example, to the expression in the cell 
of a variant NAP or NAC protein, respectively, either of 
which can compete with the normal binding function of NAC 

20 and, therefore, can decrease the association of NAP and 
NAC proteins in a cell. The term "variant" is used 
generally herein to mean a protein that is different from 
the NAP or NAC protein that normally is found in a 
particular cell type. In addition, the normal 

25 association of NAP and NAC proteins in a cell can be 

increased or decreased due, for example, to contact of 
the cell with an agent such as a drug that can 
effectively alter the association of NAP and NAC proteins 
in a cell. 

30 

NB-ARC and CARD domain proteins of the invention, 
NAC(3, NACy and NAC5, were characterized, for example, 
using an in vitro binding assay and CARD-containing 
proteins were further characterized using the yeast two 
35 hybrid system. An in vivo transcription activation assay 
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such as the yeast two hybrid system is particularly 
useful for identifying and manipulating the association 
of proteins. In addition, the results observed in such 
an assay likely mirror the events that naturally occur in 
5 a cell* Thus, the results obtained in such an in vivo 
assay can be predictive of results that can occur in a 
cell in a subject such as a human subject. 

A transcription activation assay such as the yeast 
two hybrid system is based on the modular nature of 
transcription factors, which consist of functionally 
separable DNA-binding and trans-activation domains. When 
expressed as separate proteins, these two domains fail to 
mediate gene transcription. However, transcription 
activation activity can be restored if the DNA-binding 
domain and the trans-activation domain are bridged 
together due, for example, to the association of two 
proteins. The DNA-binding domain and trans-activation 
domain can be bridged, for example, by expressing the 
DNA-binding domain and trans-activation domain as fusion 
proteins (hybrids), provided that the proteins that are 
fused to the domains can associate with each other. The 
non-covalent bridging of the two hybrids brings the 
DNA-binding and trans-activation domains together and 
creates a transcriptionally competent complex. The 
association of the proteins is determined by observing 
transcriptional activation of a reporter gene (see 
Example I) . 

30 The yeast two hybrid systems exemplified herein use 

various strains of S. cerevisiae as host cells for 
vectors that express the hybrid proteins. A 
transcription activation assay also can be performed 
using, for example, mammalian cells. However, the yeast 

35 two hybrid system is particularly useful due to the ease 
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of working with yeast and the speed with which the assay 
can be performed. For example, yeast host cells 
containing a lacZ reporter gene linked to a LexA operator 
sequence were used to demonstrate that the CARD L domain of 
5 NAC (amino acid residues 1128-1473 of SEQ ID NO: 2) can 
interact with several CARD-containing proteins (see 
Examples) . For example, in one case the DNA-binding 
domain consisted of the LexA DNA-binding domain, which 
binds the LexA promoter, fused to the CARD L domain of NAC 

10 and the trans-activation domain consisted of the B42 

acidic region separately fused to several cDNA sequences 
which encoded CARD-containing proteins. When the LexA 
domain was non-covalently bridged to a trans-activation 
domain fused to a CARD-containing protein, the 

15 association activated transcription of the reporter gene. 

A NAP, for example, a CARD-containing protein or an 
NB-ARC-containing protein also can be identified using an 
in vitro assay such as an assay utilizing, for example, a 

20 glutathione-S-transf erase (GST) fusion protein as 
described in the Examples. Such an in vitro assay 
provides a simple, rapid and inexpensive method for 
identifying and isolating a NAP. Such an in vitro assay 
is particularly useful in confirming results obtained in 

25 vivo and can be used to characterize specific binding 

domains of a NAP. For example, a GST/CARD L fusion protein 
can be expressed and can be purified by binding to an 
affinity matrix containing immobilized glutathione. If 
desired, a sample that can contains a CARD-containing 

30 protein or active fragments of a CARD-containing protein 
can be passed over an affinity column containing bound 
GST/CARD L and a CARD-containing protein that binds to 
CARD L can be obtained. In addition, GST/ CARD L can be used 
to screen a cDNA expression library, wherein binding of 
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the GST/ CARD L fusion protein to a clone indicates that the 
clone contains a cDNA encoding a CARD-containing protein. 

In another embodiment of the invention, methods are 
5 provided for monitoring the progress of treatment for a 
pathology that is characterized by an increased or 
decreased level of apoptosis in a cell, which methods are 
useful to ascertain the feasability of such treatment. 
Monitoring such a therapy, such as, e.g., a therapy that 

10 alters association of a NAC with a NAC-associated protein 
in a cell using an effective agent, can allow for 
modifications in the therapy to be made, including 
decreasing the amount of effective agent used in therapy, 
increasing the amount of effective agent, or using a 

15 different effective agent. In general, a test sample can 
be obtained from a subject having a pathology 
characterized by increased or decreased apoptosis, which 
sample can be compared to a control sample from a normal 
subject to determine whether a cell in the test sample 

20 has, for example, increased or decreased expression of 
NAC. Preferably, this control sample is a previous 
sample from the same patient, thereby providing a direct 
comparison of changes to the pathology as a result of the 
therapy. The level of a NAC in a cell can be determined 

25 by contacting a sample with a reagent such as an anti-NAC 
antibody or a NAC-associated protein, either of which can 
specifically bind a NAC . For example, the level of a NAC 
in a cell can determined by well known immunoassay or 
immunohistochemical methods using an anti-NAC antibody 

30 (see, for example, Reed et al . , supra, 1992; see, also, 
Harlow and Lane, supra, (1988) ) . 

In accordance with another embodiment of the 
invention, there are provided methods for determining a 
35 prognosis of disease free or overall survival in a 
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patient suffering from cancer* For example, it is 
contemplated herein that abnormal levels of NAC proteins 
(either higher or lower) in primary tumor tissue show a 
high correlation with either increased or decreased tumor 
5 recurrence or spread, and therefore indicates the 

likelihood of disease free or overall survival. Thus, 
the present invention advantageously provides a 
significant advancement in cancer management because 
early identification of patients at risk for tumor 

10 recurrence or spread will permit aggressive early 

treatment with significantly enhanced potential for 
survival. Also provided are methods for predicting the 
risk of tumor recurrence or spread in an individual 
having a cancer tumor; methods for screening a cancer 

15 patient to determine the risk of tumor metastasis; and 
methods for determining the proper course of treatment 
for a patient suffering from cancer. These methods are 
carried out by collecting a sample from a patient and 
comparing the level of NAC expression in the patient to 

20 the level of expression in a control or to a reference 

level of NAC expression as defined by patient population 
sampling, tissue culture analysis, or any other method 
known for determining reference levels for determination 
of disease prognosis. The level of NAC expression in the 

25 patient is then classified as higher than the reference 
level or lower than the reference level, wherein the 
prognosis of survival or tumor recurrence is different 
for patients with higher levels than the prognosis for 
patients with lower levels. 

30 

All U.S. patents and all publications mentioned 
herein are incorporated in their entirety by reference 
thereto. The invention will now be described in greater 
detail by reference to the following non-limiting 
35 examples . 
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EXAMPLES 

1.0 cDNA Cloning. Jurkat total RNA was 
reverse-transcribed to complementary DNAs using MMLV 
5 reverse transcriptase (Stratagene) and random 

hexanucleotide primers. Three overlapping cDNA fragments 
of NAC were amplified from the Jurkat complementary DNAs 
with Turbo Pfu DNA polymerase (Stratagene) using the 
following oligonucleotide primer sets: primer set 1; 
10 5 T -CCGAATTCACCATGGCTGGCGGAGCCTGGGGC-3 T (forward; SEQ ID 
NO: 13) and 5 T -CCGCTCGAGTCAACAGAGGGTTGTGGTGGTCTTG-3 T 
(reverse; SEQ ID NO: 14), primer set 2; 

5 ' -CCCGAATTCGAACCTCGCATAGTCATACTGC-3 T (forward; SEQ ID 
NO: 15) and 5 T -GTCCCACAACAGAATTCAATCTCAACGGTC-3 ' (reverse; 

15 SEQ ID NO:16), and primer set 3; 

5 T -TGTGATGAGAGAAGCGGTGAC-3 1 (forward; SEQ ID NO: 17) and 
5 T -CCGCTCGAGCAAAGAAGGGTCAGCCAAAGC-3 1 (reverse; SEQ ID 
NO:18). The resultant cDNA fragments were ligated into 
mammalian expression vector pcDNA-myc (Invitrogen, 

20 modified as described in Roy et al . , EMBO J. 16:6914-6925 
(1997) ) and assembled to full-length cDNA by ligating 
fragments 2 and 3 at the EcoRI site to make fragment 4, 
and by ligating fragments 1 and 4 at the Bst XI site, as 
depicted in Figure 1A. Sequencing analysis of the 

25 assembled full-length cDNA was carried out, and splice 
isoforms (shown as dotted and hatched regions in Figure 
IB) of NAC clones were identified. The full-length NAC 
nucleotide and protein sequences, including two 
alternatively spliced regions underlined (nucleotides 

30 2870-2959 and 3784-3915 of SEQ ID NO:l, respectively), 
are presented in Figure 1C. The full length nucleotide 
sequence of three of the isoforms is set forth in SEQ ID 

NOs:l, 3 and 5, corresponding to NACp, NACy and NAC5, 
respectively. 
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Comparison of NAC to known protein sequences using 
Clustal multiple sequence alignment (Thompson et al . , 
Nucleic Acids Research 22:4673-4680 (1994)) revealed that 
the CARD domain of NAC (see, e.g., residues 1373 to 1473 
5 of SEQ ID NO: 2) is similar to numerous CARD domain 

proteins. Further sequence analysis predicted an a s p 8 
(TIM) -Barrel-like domain similar to those observed in 
aldolase and RuBisCo in NAC, located on the immediate 
amino terminal side of the predicted CARD domain (see, 

10 e.g., residues 1079 to 1364 of SEQ ID NO:2). 

Additionally, a portion of NAC was found to have sequence 
portions homologous to NB-ARC domains (see, e.g., 
residues 329 to 547 of SEQ ID NO: 2) and a leucine-rich 
repeat region (see, e.g., residues 808 to 947 of SEQ ID 

15 NO:2). Based on its homology to the above proteins the 
protein of the invention has been termed a NAC protein, 
as it is a NB-ARC and CARD domain containing protein. 
ClustalW multiple sequence alignment with other NB-ARC 
and CARD domain containing proteins confirmed the 

20 homology of NAC to other proteins in both the NB-ARC 
region (particularly in the P-loop, or Walker A, and 
Walker B portions) and CARD region (Figure ID and Figure 
IE, respectively) . This sequence analysis represents the 
first time a domain resembling a TIM-barrel domain has 

25 been identified in a protein that also contains a CARD 
domain, and also the first time a domain resembling a 
TIM-barrel domain has been identified in a protein that 
also contains an NB-ARC domain. 



30 2.0 Plasmid Constructions. Complementary DNA encoding 
the CARD domain of NAC was amplified from Jurkat cDNAs 
with Turbo Pfu DNA polymerase (Stratagene) and primer set 
3 as described above. The resultant PCR fragments were 
digested with EcoRI and Xho I restriction enzymes and 

35 ligated into pGEX-4Tl (Pharmacia) and pcDNA-myc vectors. 



78 

This region of NAC contains two alternatively spliced 
isoforms, termed CARD L (amino acid residues 1128-1473 of 
SEQ ID NO: 2) and CARD S (amino acid residues 1128-1261 and 
1306-1473 of SEQ ID NO : 2 ) . The region of cDNA encoding 
5 NB-ARC domain was PCR- amplified using primers SEQ ID 
N0:15 (forward) and SEQ ID NO:14 (reverse). The 
resultant PCR fragment was digested with EcoRI and Xho I 
restriction enzymes (yielding a fragment encoding amino 
acid residues 326-551 of SEQ ID N0:2) and ligated into a 
10 pGEX-4Tl and pcDNA-myc vectors. 

3.0 In vitro Protein Binding Assays. NB-ARC, CARD L , and 
CARD S in pGEX-4Tl were expressed in XL-1 blue E. coli 
cells (Stratagene) , and affinity-purified using 

15 glutathione (GSH) -sepharose according to known methods, 
such as those in Current Protocols in Molecular Biology , 
Ausubel et al. eds . , John Wiley and Sons (1999). For GST 
pull-down assays, purified CARD L and CARD S GST fusion 
proteins and GST alone (0.1-0.5 ]xg immobilized on 10-15 

20 |jl GSH-sepharose beads) were incubated with 1 mg/ml of 

BSA in 100 pi Co-IP buffer [142.4 mM KC1, 5mM M g Cl 2/ 10 mM 
HEPES (pH 7.4), 0.5 mM EGTA, 0.2% NP-40, 1 mM DTT, and 1 
mM PMSF] for 30 min. at room temperature. The beads were 
then incubated with 1 ]il of rat reticulocyte lysates 

25 (TnT-lysate; Promega, Inc.) containing 35 S-labeled, in 

vitro translated CARD L , CARD S , or control protein Skp-1 in 
100 p.1 Co-IP buffer supplemented with 0.5 mg/ml BSA for 
overnight at 4°C. The beads were washed four times in 
500 i_il Co-IP buffer, followed by boiling in 20 \xl 

30 Laemmli-SDS sample buffer. The eluted proteins were 

analyzed by SDS-PAGE. The bands of SDS-PAGE gels were 
detected by f luorography . 

The resultant homodimerization pattern reveals that 
35 CARD L -CARD L , CARD S -CARD S , and both CARD L - CARD S containing 



lanes have very strong signals, whereas lanes containing 
control GST alone and control Skp-1 have negligible 
signals (Figure 2A) . Thus, CARD domains of the invention 
NAC show a very strong ability to self-associate in 
5 vitro. 

In vitro translated Apaf-1 (lacking its WD domain), 
CED4, and control Skp-1 proteins were subjected to GST 
pull-down assay using GSH-sepharose beads conjugated with 

10 GST, GST-CARD L , and GST-CARD S as described above. Both 

lanes containing GST-CARD S and lanes containing GST-CARD L 
yielded very strong signals when incubated with either 
Apaf-1 (-WD) or CED4, whereas, the controls GST alone and 
Skp-1 again yielded negligible signals (Figure 2B) . 

15 Thus, in addition to self-association, CARD domains of 
the invention NAC demonstrate the ability to in vitro 
associate with other CARD-containing proteins. 

4.0 Protein Interaction Studies in Yeast. EGY48 yeast 

20 cells (Saccharomyces cerevisiae: MATcx, trpl, ura3, his, 
leu2 : :plexApo6-leu2) were transformed with pGilda-CARDL 
plasmids (his marker) encoding the LexA DNA binding 
domain fused to: CARD domains of NAC (CARDJ and 
caspase-9; pro-caspase-8 ; Apaf-1 without its WD domain; 

25 Bcl-XL, Bax and Bcl-2 without transmembrane domains. 
EGY48 were also transformed with vector pJG4-5 (trpl 
marker) encoding the above listed group of proteins and 
additionally vRas and FADD as target proteins, fused to 
B42 transactivation domain, and the cells were 

30 transformed with a LexA-LacZ reporter plasmid pSH1840 
(ura3 marker,), as previously described (Durfee et al . , 
1993; Sato et al., 1995). Sources for cells and plasmids 
were described previously in U.S. Patent 5,632,994, and 
in Zervous et al . , Cell 72:223-232 (1993); Gyuris et al . , 

35 Cell 75:791-803 (1993); Golemis et al . , In Current 
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Protocols in Molecular Biology (ed. Ausubel et al . ; Green 
Publ . ; NY 1994) f each of which is incorporated herein by 
reference, Transf ormants were replica-plated on 
Burkholder T s minimal medium (BMM) plates supplemented 
5 with leucine and 2% glucose as previously described (Sato 
et al., Gene 140:291-292 (1994)). Protein-protein 
interactions were scored by growth of transf ormants on 
leucine deficient BMM plates containing 2% galactose and 
1% raffinose. 

10 

Protein-protein interactions were also evaluated 
using |3-galactosidase activity assays. Colonies grown on 
BMM/Leu/Glucose plates were filter-lifted onto 
nitrocellulose membranes, and incubated over-night on 

15 BMM/Leu/galactose plates. Yeast cells were lysed by 

soaking filters in liquid nitrogen and thawing at room 
temperature. (J-galactosidase activity was measured by 
incubating the filter in 3.2 ml Z buffer (60 mM, Na 2 HP0 4 , 
40 mM Na 2 HP0 4 , 10 mM KC1, 1 mM MgS0 4 ) supplemented with 50 

20 ]il X-gal solution (20mg/ml) . Levels of |3-galactosidase 
activity were scaled according to the intensity of blue 
color generated for each transf ormant . 

The results of this experiment showed colonies on 
25 leucine deficient plates for yeast containing 

NAC - CARD L / L e xA fusions together with caspase-9/B42, 
Apaf-1/B42, and Bax/B42 fusions (Figure 3) . In addition, 
the NAC-CARD L /LexA: caspase-9/B42 and 

NAC-CARD L /LexA : Apaf- 1/B42 cells had significant amounts of 
30 LacZ activity. The cells containing the complementary 
fusions caspase-9/LexA:NAC-CARD L /B4 2 and 

Apaf-1 /LexA: NAC-CARD L /B42 also grew on leucine deficient 
plates and showed significant LacZ activity. Thus all 
four indicators of protein : protein interaction confirmed 
35 that the CARD L domain of NAC interacts with the CARD 
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domains of caspase-9 and with Apaf-1. Partial indication 
of the protein : protein interactions with NAC-CARD L were 
observed for Bax, caspase-8, Bcl-XL and Bcl-2, suggesting 
that a broad range of CARD domain proteins also interact 
5 with the CARD domain of NAC . 

Similar two-hybrid interaction experiments have been 
performed using the CARD domain of the CARD-X protein. 
Table I summarizes the results of the two-hybrid 

10 experiments wherein a fusion protein containing the DNA- 
binding domain of the LexA protein expressed from the 
pGilda plasmid and a CARD domain from CARD-X or several 
other CARD-containing proteins, including CARDIAK, NAC 
(CARD L ) , Apaf-1, caspases-2, 9, and 11, were expressed in 

15 the sames cells as CARD domains from CARD-X, CARDIAK, 
NAC ( CARD L ) , caspase-9 and clAP-2, expressed as fusion 
proteins with a transactivation domain from the B42 
protein from the pJG4-5 plasmid, as described above. As 
shown, the CARD domain of CARD-X interacted with itself 

20 but not with the CARD domains of other proteins. 
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TABLE I 

Yeast Two Hybrid Analysis of CARD-X : CARD interactions 



pGilda pJG4-5 Results 



1 


CARD-X CARD 


CARD-X-CARD 


+ + + 


2 


CARD-X CARD 


CARDIAK 




3 


CARD-X CARD 


NAC-CARD L 




4 


CARD-X CARD 


Caspase-9 CARD 




5 


CARD-X CARD 


cIAP-2 




6 


CARDIAK 


CARD-X CARD 




7 


NAC-CARD L 


CARD-X CARD 




8 


APAF C3+C4 


CARD-X CARD 




9 


Caspase-2 


CARD-X CARD 




10 


Caspase-11 


CARD-X CARD 




11 


Caspase 9-C- terminus 


CARD-X CARD 




12 


CARDIAK 


CARDIAK 


+ + + + 



5.0 Self -Association of NB-ARC domain of NAC. In vitro 
translated, 35 S-labeled rat reticulocyte lysates (1 pi) 
containing NB-ARC or Skp-1 (used as a control) were 
incubated with GSH-sepharose beads conjugated with 
purified GST-NB-ARC or GST alone for GST pull-down assay, 
resolved on SDS-PAGE and visualized by fluorography as 
described above. One tenth of input were loaded for 
NB-ARC or Skp-1 as controls. In this assay, the 
NB-ARC-containing fragment of NAC demonstrates a strong 
ability to homodimerize (Figure 4). 

The ability to self-associate and to bind other 
known CARD domains establishes the CARD domains of NAC, 
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CARD S and CARD L , as capable of the same protein-protein 
interactions observed in other known CARD domains. The 
ability of CARD-X to self-associate also establishes this 
protein as having the same protein-protein interaction 
properties of known CARD proteins. Thus two isoforms of 
a new human CARD domain have been characterized, and a 
highly related sequence of another human protein CARD-X 
has also been characterized. In addition, the ability of 
the putative NB-ARC domain of NAC has been shown to both 
self-associate, establishing this domain as capable of 
the same protein-protein interactions observed in other 
known NB-ARC domains. Therefore, the NAC protein has 
been demonstrated to contain both a functional CARD 
domain and a functional NB-ARC domain. 

6.0 Protein-Protein Interactions of NAC. Transient 
transfection of 293T, a human embryonic kidney fibroblast 
cell line, were conducted using SuperFect reagents 
(Qiagen) according to manufacturer's instructions. The 
cDNA fragments encoding full-length CED4 and the 
truncated form of Apaf-1 (Apaf-IAWD) comprising amino 
acids 1-420 of the human Apaf-1 protein were amplified by 
PCR and subcloned into pcDNA3HA at EcoRI and Xho I sites. 
Expression plasmids encoding catalytically inactive forms 
of pro-Casp8 [pro-Casp8 (C/A) ] was prepared by replacing 
Cys 377 with an Ala using site-directed mutagenesis and 
pro-Casp9 [pro-Casp9 (C/A) ] has been described 
previously, Cardone et al., Science 282:1318-1321 
(1998)). 293T cells were transiently transfected with an 
expression plasmid (2 pg) encoding HA-tagged human Apaf- 
1AWD, CED4, pro-Casp8 (C/A) or C-Terminal Flag-tagged 
pro-Casp9 (C/A) in the presence or absence of a plasmid 
(2 pg) encoding myc-tagged NAC (encoding amino acid 
residues 1-1261 and 1306-1473 of SEQ ID NO:2). After 24 
hr growth in culture, transfected cells were collected 
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and lysed in Co-IP buffer [142.4 mM KC1, 5 mM MgCl 2 , 10 mM 
HEPES (pH 7.4), 0.5 mM EGTA, 0.1 % NP-40, and 1 iriM DTT] 
supplemented with 12.5 mM p-glycerolphosphate, 2 mM NaF, 
1 mM Na 3 V0 4 , 1 mM PMSF, and IX protenase inhibitor mix 
(Boehringer Mannheim) . Cell lysates were clarified by 
microcentrifugation and subjected to immunoprecipitation 
using either a mouse monoclonal antibody to myc (Santa 
Cruz Biotechnologies, Inc) or a control mouse IgG. 
Proteins from the immune complexes were resolved by SDS- 
PAGE, transferred to nitrocellulose membranes, and 
subjected to immunoblot analysis using anti-HA antibodies 
followed by anti-myc antibodies using a standard Western 
blotting procedure and ECL reagents from Amersham- 
Pharmacia Biotechnologies, Inc. (Krajewski et al., Proc. 
Natl. Acad. Sci. USA 96:5752-5757 (1999)). 

The results show that NAC of the invention interacts 
with other NB-ARC and CARD-containing proteins, Apaf-1 
(Figure 5A) and CED-4 (Figure 5B) , and additionally with 
caspase-8 (Figure 6A) , but not with caspase-9 (Figure 
6B) . This is in contrast with the observed interaction 
between caspase-9 and the CARD L domain of NAC from the 
above described yeast two-hybrid assay. This may be due 
to the regulation of the full-length NAC in terms of its 
ability to interact with pro-caspase-9 such that NAC is 
in either a latent (off) or active (on) conformation, 
analogous to Apaf-1 which binds pro-caspase-9 only when 
cytochrome c is produced to induce a conformational 
change in Apaf-1. As with NAC, if only the CARD domain 
of Apaf-1 is expressed, it will bind to pro-caspase-9 
independently of the coactivator, cytochrome c (Qin et 
al., Nature 399:549-557 (1999)). 



Although the invention has been described with 
reference to the examples above, it should be understood 
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that various modifications can be made without departing 
from the spirit of the invention. Accordingly, the 
invention is limited only by the following claims. 
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That which is claimed is: 

1* Isolated nucleic acid encoding a NB-ARC and 
CARD containing protein (NAC) , or functional fragments 
thereof, selected from: 

(a) DNA encoding the amino acid sequence set 
forth in SEQ ID N0s:2, 4 or 6, or 

(b) DNA that hybridizes to the DNA of (a) 
under moderately stringent conditions, wherein said 
DNA encodes biologically active NAC, or 

(c) DNA degenerate with respect to either (a) 
or (b) above, wherein said DNA encodes biologically 
active NAC. 

2. A nucleic acid according to claim 1, wherein 
said nucleic acid hybridizes under high stringency 
conditions to the NAC coding portion of any of SEQ ID 
NOs : 1, 3 and 5 . 

3. A nucleic acid according to claim 1, wherein 
the nucleotide sequence of said nucleic acid is 
substantially the same as set forth in any of SEQ ID 
NO : 1 , 3 and 5 . 

4. A nucleic acid according to claim 1, wherein 
the nucleotide sequence of said nucleic acid is the same 
as that set forth in any of SEQ ID NOs:l, 3 and 5. 

5. A nucleic acid according to claim 1, wherein 
said nucleic acid is cDNA. 



6. A vector containing the nucleic acid of claim 
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7. Recombinant cells containing the nucleic acid 
of claim 1 . 

8. An oligonucleotide comprising at least 15 
nucleotides capable of specifically hybridizing with a 
the nucleotide sequence set forth in any of SEQ ID NOs:l, 
3 and 5 . 

9. An oligonucleotide according to claim 8, 
wherein said oligonucleotide is labeled with a detectable 
marker . 

10. An antisense-nucleic acid capable of 
specifically binding to mRNA encoded by said nucleic acid 
according to claim 1. 

11. A kit for detecting the presence of the NAC 
cDNA sequence comprising at least one oligonucleotide 
according to claim 9 . 

12. An isolated NAC protein comprising a NB-ARC 
domain, a CARD domain and a TIM-Barrel-like domain. 

13. The protein of claim 12, further comprising a 
LRR domain. 

14. An isolated protein according to claim 12, 
wherein the amino acid sequence of said protein comprises 
substantially the same sequence as any of SEQ ID NOs:2, 4 
or 6 . 

15. A NAC according to claim 14 comprising the same 
amino acid sequence as set forth in any of SEQ ID N0s:2, 

4 or 6. 



16. A NAC according to claim 14, wherein said 
protein is encoded by a nucleotide sequence comprising 
substantially the same nucleotide sequence as set forth 
in SEQ ID 3ST0s:l, 3 or 5. 

5 

17. A NAC according to claim 14, wherein said 
protein is encoded by a nucleotide sequence comprising 
the same sequence as set forth in SEQ ID NOs:l, 3 or 5 . 

10 18. A method for expression of a NAC protein, said 

method comprising culturing cells of claim 7 under 
conditions suitable for expression of said NAC. 

19. An isolated anti-NAC antibody having specific 
15 reactivity with a NAC according to claim 12. 

20. Antibody according to claim 19, wherein said 
antibody is a monoclonal antibody. 

20 21. A cell line producing the monoclonal antibody 

of claim 20. 



22. An antibody according to claim 19, wherein said 
antibody is a polyclonal antibody. 

25 

23. A composition comprising an amount of the 
antisense-nucleic acid according to claim 10 effective to 
inhibit expression of a human NAC and an acceptable 
hydrophobic carrier capable of passing through a cell 

3 0 membrane. 



24. A transgenic nonhuman mammal expressing 
exogenous nucleic acid according to claim 1, encoding a 
NAC. 



35 
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25. A transgenic nonhuman mammal according to claim 
24, wherein said nucleic acid encoding said NAC has been 
mutated, and wherein the NAC so expressed is not native 
NAC. 

26. A transgenic nonhuman mammal according to claim 
24, wherein the transgenic nonhuman mammal is a mouse. 

27. A method for identifying nucleic acids encoding 
a mammalian NAC, said method comprising: 

contacting a sample containing nucleic acids with an 
oligonucleotide according to claim 8, wherein said 
contacting is effected under high stringency 
hybridization conditions, and identifying compounds which 
hybridize thereto. 

28. A method for detecting the presence of a human 
NAC in a sample, said method comprising contacting a test 
sample with an antibody according to claim 19, detecting 
the presence of an antibody: NAC complex, and therefor 
detecting the presence of a human NAC in said test 
sample . 

29. Single strand DNA primers for amplification of 
NAC nucleic acid, wherein said primers comprise a nucleic 
acid sequence derived from the nucleic acid sequences set 
forth as SEQ ID NOs:l, 3 and 5. 

30. A method for modulating the activity of an 
oncogenic protein, comprising contacting said oncogenic 
proteins with a substantially pure NAC, or an oncogenic 
protein-binding fragment thereof. 
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31. A method of identifying an effective agent 
that alters the association of a NAC with a NAC 
associated protein (NAP) , comprising the steps of: 

a) contacting said NAC and NAP proteins, 
under conditions that allow said NAC and NAP 
proteins to associate with an agent suspected of 
being able to alter the association of said NAC and 
NAP proteins; and 

b) detecting the altered association of said 
NAC and NAP proteins, wherein said altered 
association identifies an effective agent. 

32. The method of claim 31, wherein said altered 
association is detected by measuring the transcriptional 
activity of a reporter gene. 

33. The method of claim 31, wherein said NAC has 
nucleotide binding activity. 

34. The method of claim 31, wherein said effective 
agent is a drug. 

35. The method of claim 31, wherein said effective 
agent is a protein. 

36. A method for modulating an activity mediated by 
a NAC protein, said method comprising: 

contacting said NAC protein with an effective, 
modulating amount of an agent identified by claim 31. 



37. The method of claim 36, wherein said modulated 
activity is selected from the group consisting of: 
binding of NAC to a CARD-containing protein; binding of 
NAC to a NB-ARC-containing protein; binding of NAC to a 
LRR-containing protein; and caspase proteolytic activity. 

38. A method of modulating the level apoptosis in a 
cell, comprising the steps of: 

a) introducing a nucleic acid molecule encoding a 
NAC into the cell; and 

b) expressing said NAC in said cell, wherein the 
expression of said NAC modulates apoptosis in said 
cell. 

39. A method of modulating the level of apoptosis 
in a cell, comprising introducing an antisense nucleotide 
sequence into the cell, wherein said antisense nucleotide 
sequence specifically hybridizes to a nucleic acid 
molecule encoding a NAC, wherein said hybridization 
reduces or inhibits the expression of said NAC in said 
cell. 

40. A therapeutic composition comprising a compound 
selected from a NAC, or functional fragment thereof, a 
NAC modulating agent identified according to claim 31, or 
an anti-NAC antibody; and a pharmaceutically acceptable 
carrier . 

41. A method of treating a pathology characterized 
by abnormal cell proliferation or abnormal inflammation, 
said method comprising administering an effective amount 
of the composition according to claim 40. 
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42. A method of diagnosing a pathology 
characterized by an increased or decreased level of a NAC 
in a subject, comprising the steps of: 

a) obtaining a test sample from the subject; 

b) contacting said test sample with an agent that 
can bind said NAC under suitable conditions, which 
allow specific binding of said agent to said NAC; 
and 

c) comparing the amount of said specific binding 
in said test sample with the amount of specific 
binding in a control sample, wherein an increased or 
decreased amount of said specific binding in said 
test sample as compared to said control sample is 
diagnostic of a pathology. 

43. The method of claim 42, wherein said agent is 
an anti-NAC antibody or a NAC-associated-protein (NAP) . 

44. A method of modulating the level of apoptosis 
in a cell, comprising contacting the cell with an agent 
that effectively alters the association of NAC with a 
NAC-associated-protein in the cell, or that effectively 
alters the activity of a NAC in the cell. 

45. A chimeric protein comprising a domain selected 
from the group consisting of the NB-ARC domain of the NAC 
of claim 14 and the CARD of the NAC of claim 14. 

46. An isolated protein comprising a TIM-Barrel- 
like domain and a second domain selected from the group 
consisting of a CARD domain, a NB-ARC domain, and a LRR 
domain . 
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47. The chimeric protein of claim 45, comprising 
the NB-ARC domain of SEQ ID NO : 2 and the CARD domain of 
SEQ ID NO: 8. 

48. The method of claim 31, wherein said agent 
modulates CARD: CARD association or NB-ARC : NB-ARC 
association. 

49. A method of modulating CARD: CARD interactions 
comprising contacting a NAC protein with the agent of 
claim 48. 

50. The method of claim 31, wherein said agent 
modulates transcription . 

51. The method of claim 50, wherein said agent 
modulates NF-kB activity. 

52. A method of modulating transcription comprising 
contacting a cell with a compound selected from the group 
consisting of: a NAC protein or functional fragment 
thereof, an agent identified according to claim 31, and 
an anti-NAC antibody. 

53. A method of diagnosing cancer or monitoring 
cancer therapy comprising contacting a test sample from a 
patient with the antibody of claim 19. 

54. A method of assessing prognosis of patients 
with cancer comprising contacting a test sample from a 
patient with the antibody of claim 19. 

55. An effective agent that binds a nucleotide 
binding site of NAC. 
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56. An effective agent that modulates the 
association of NAC or CARD-X with a pro-caspase or a 
caspase . 

57. The method of claim 56, wherein said pro- 
caspase is pro-caspase-8 and said caspase is caspase-8. 

58. The method of claim 56, wherein said pro- 
caspase is pro-caspase-9 and said caspase is caspase-9. 

59. The method of claim 56, wherein said effective 
agent inhibits the association of said NAC with said pro- 
caspase or said caspase. 

60. The method of claim 56, wherein said effective 
agent increases the association of said NAC with said 
pro-caspase or said caspase. 

61. An effective agent that modulates the 
association of NAC or CARD-X with a CED-4 family protein. 

62. The method of claim 61, wherein said CED-4 
family protein is selected from the group consisting of 
CED-4, Apaf-1, Dark, and CARD4/nodl. 

63. The method of claim 61, wherein said CED-4 
family protein is Apaf-1. 

64. The method of claim 61, wherein said effective 
agent inhibits the association of said NAC with said CED- 
4 family protein. 
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65. The method of claim 61, wherein said effective 
agent increases the association of said NAC with said 
CED-4 family protein. 
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ABSTRACT 

The present invention provides NB-ARC and CARD-containing 
5 proteins (NACs) , nucleic acid molecules encoding NACs and 
antibodies specific for at least one NAC . The invention 
further provides chimeric NAC proteins. The invention 
also provides screening assays for identifying an agent 
that can effectively alter the association of a NAC with 

10 a NAC-associated protein. The invention further provides 
methods of modulating apoptosis in a cell by introducing 
into the cell a nucleic acid molecule encoding a NAC or 
an antisense nucleotide sequence* The invention also 
provides a method of using a reagent that can 

15 specifically bind to a NAC to diagnose a pathology that 
is characterized by an increased or decreased level of 
apoptosis in a cell. 
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KG OCT GX GGA OX 103 GX OX CTG OX TGT TTC TIG GG TIC OGAPGAPGGGGG OG APG GPG TIC OG OT OG OC GX A*£T APA GOG OC TX AGS AX TCT TGG GGT GG 120 

MAGGAWGRLACYLEFLKKEE LKEFQLLLANKAHSRSSSGE 40 

ACA GX GCT CPG CCA GPG APG AX AGT QX ATG GPG GIG GX ICG IPC CIG GIG GCT OG W OE GC Cffi OS OT TO (K Cm QX OC Off ^ TO3 (K CPG ATC 240 

TPAQPEKTSGMEVASYLVAQ YGEQRAWDLALHTWEQMGLR 80 

7XA OG TX GX CPA GX CAG GPA GOG CCA GX OC TCT CX TCA TIC CCC TAC AX CCA AGT GAA GX OC CIG GOG TOT GX AX CPA GX AX TX AX GCATjIG CIA ATG CCC TX 360 

SLCAQAQEGAGHS PSFPYS P SEPHLGSPSQPTSTAVLMPW 120 

ATC CAT GAA TIG CCG GOG GX TX AX OG GX TCA GPG AGA AX GIT TIG ACA OG CIG CCT OC ACA TCT GGA GX GX TOG AGA GPA ATC TCT GX TCACTC CTC TPC CPA OCT CIT 480 

IHELPAGCTQGSERRVLRQL PDTS GRRWREISASLLYQAL 160 

CCA AX TCC GCA GPC Off GPG TCT CCA AX CTG GPG TCA OX AAC GX QX ACA ICC ACA GCA GIG CTG GQ3 AX TGG GGA TCC CCA OCT CPG OX AX CIA CCA CCC AGA GPG OG GPG 600 

PSSPDHESPSQESPNAPTST AVLGSWGSPPQPSLAPREQE 200 

GCT GCT CGG AX CPA TGG CCT CIG GAT GPA AX TCA CCA ATT TAC TPC ACA CPA ATC AGA GAA AGA GPG AGA GPG APA TCA GPG Pm GX AX GX CCA TX GCA GX GTG GTA GGA AX 720 

*APGTQWPLDETSGIYYTEIR EREREKSEKGR PPWAAVVGT 240 

GX CCA CAG GCG QC AX AX CIA OG OX OC OC OC CCA TGG GPG OCT TCT GIG AGA GPG AX CTC TGT TCC ACA TGG OX TGG AAA AAT GPG CAT TIT APC CPA APA TTC PCPi OG 840 

PPQAHTS LQPHHHPWEPSVR ES LCSTWPWKNEDFNQKFTQ* 290 

CTG CIA CTT CIA CPA AGA GCT OC GX AGA AX CPA GAT GX CTG GIC AAG AGA AX TG3 OCT GP£T TAT GIG GPG GPG APT OGA GGA CAT TTA ATT GPG ATC AGA GPC TEA TIT GX CCA 960 

LLLLQRPH PRSQDPLVKRSW PDYVEENRGHLIEIRDLFGP 320 

GX CTG GAT AX CPA GPA CCT GX ATA GTC ATA CTG CPG GGG GCT GCT GGA ATT 033 AAG TCA ACA CTG GX AX CPG GIG APG GPA GX TGG GGG AGA GX OG CIG TAT GGG GPC GX 1080 

GLDTQEPR IVI LOGAAGI GK STL ARQVKEAWGRGQLYGDR 360 

P-loop (Walker A) 

TIC OG OCT GIC TTC TAC TTC AX TX AGA GPG CTG GX OG TX APG GIG GIG AGT CTC GCT GPG CTC ATC GGA APA GAT GX ACA GX ACT GX GCT GX ATT AGA OG ATC CIG TCT 1200 

FQHVFY FSCRELAQSKVVSL. AELTGKDGTATPAPIRQI LS 400 

/GG CCA GG CGG CIG CIC TTC ATC CIC GAT GGT GCA GAT GPG GCA GGA TGG GIC TIG OG QG GX A3T TCT GPG CIC TGT CTG OC TGG AX OG CCA OG GGG GGG GAT GCA CTG CIG 1320 

R P E R L L F I L D G V DEPGWVLQ EPSSELCLHWSQPQPADALL 440 

Walker B 

GX AGT TIG CIG GX AAA ACT ATA CTT GX GPG GCA TX TTC CTG ATC AX GCT CGG AX ACA GCT CTG OG APC CTC ATT CCT TCT TTG GPG OG GCA CGT TGG GTA GPG GTC CIG GX 1440 

GSLLGKTI LPEASFLITART T A L Q* N L I PSLEQ.ARWVEVL.G 480 

TIC TCT OG TX AX AX AAG GAA TAT TIC TAC AGA TAT TTC ACAGAT GPA AX CAA GCA ATT ACA GX TIT AX TTG GTC APA TCA APC APA GG CTC TX GX CTG TGT CTT GTG OX 1560 

FSESSRKEYFYRYFTDERQA IRAFRLVKSNKELWALCLVP 520 

TGG GIG TX TX CTG GX TX ACT TX CTG ATG OG OG ATG APG GX APG GPA APA CTC ACA CTG PCX TCC AAG AX AX ACA AX CIC TGT OA CAT TTC CTT CCC CPG GCT CTC CPA 1680 

WVSWLACTCLMQQMKRKEKL TLTSKTTTTLCLHYLAQALQ 560 

: f ::; ^GCTCPGCCATO OA APA PPG AX CTT TTC AGT GCA GAT GPC CTC AX AAG CAT GX TTA GAT GX GX ATC 1800 

:h ;;!A QPLGPQLRDLCSLAAEGIW QKKTLFSPDDLRKHGLDGAI 600 

; Uj^ffC TCC AX TIC TTG PPG ATG GGT ATT CTT CPA GPG GPC OX ATC CCT CTG AX TPC AX TTC ATT CAC CIC TGT TTC CPA GPG TTC ITT GCA GCA ATG TX TAT GTC TTG GPG GAT GAG 1920 

jlVllI STFLKMGILQEHPI PLSYS FI HLC FQEFFAAMSYVLEDE 640 

^ i:: \AAS GGG A3A GGT AAA CAT TCT AAT TGC ATC ATA GAT TTG CPA PPG AX CIA GPA GCA TAT GGA ATA CAT GX CTG TTT GGG GCA TCA AX AO\ 0GT TTC CIA TTG GX CIG TTA AGT GAT 2040 

tu!&C GRGKHSNCI I DLEKTLEAY GIHGLFGASTTRFLLGLLSD 680 

^ OG GX GAG AGA GPG ATG GAG PPC ATC TIT CAT TGC GGG CTG TCT CPG GCG AX PPC CTG ATG CPG TGG GIC GGG TGC CTG GPG CTG CTG CTG CPG GCA OC TCT CIG GPG TCC CTC OC 2160 

GEREMENI FHCRLSQGRNL MQWVPSLQLLLQPHSLESLH 720 

jf; PSC TTG TTC GPG ACT GX APC APA AX TTC CIG ACA CPA GIG ATG GX OT TTC GPA GPA ATG GX ATG TGT GTA GPA ACA GPC ATG GPG CTC TTA GIG TX ACT TTC TGC ATT APA TTC 2280 

IsI'Sc LYETRNKTFLTQVMAHFEE MGMCVETDMELLVCT FCI KF 760 

H hkcC GX OC GIG APG PPG CTT OG CTG ATT GPG GX AX OG OC AGA TCA PCA TGG AX OX AX ATG GTA GTC CTG TTC AX TX GIC GCA GTC PCA GAT GX TAT TGG OG ATT CIC 2400 

;L;.S RHVKKLQLI EGRQHRSTWS PTMVVLFRWVPVTDAYWQIL 800 

:; TTC TCC GTC CTC APG GTC AX AGA APC CTG PPG OG OG GPC CTA AGT GGA APC TGG CIG AX GPC TCT GCA GTG AAG AGT CTT TGT PPG AX CIG AGA OX CCT GX TGC CTC CTG GPG 2520 

SFSVLKVTRNLKELDLSGNSX. SHSAVKSLCKTLRRPRCLLE 840 

.p^AX CIG CGG TTG GCT GX TGT GX CTC PCA GCT GPG OC TGC APG GPC CTT GX TTT GX CIG PGA GX APC OG AX CTG AX GPG CTG GAC CIG AX TTC PPT GTG CTC AX GAT GCT 2640 

: *"&T LRLAGCGLTAEDCKDLAFG LRANQTLTELDLSFNVLTDA 880 

yijGGA GX AAA OC CTT TX OG AGA CTG AGA OG OX AX TGC PPG CTA OG CCA CIG OG CIG GTC AX TGT GX CTC AX TCT GPC TGC TGC OG OC CIG GX TCT GTG CTT AGT GX 2760 

~::JG AKHLCQRLRQPSCKLQRLQ LVSCGLTSDCCQDLASVLSA 920 

^LIpGC GX AX CIG AAG GPG CTA GPC CTG OG OG AAC AAC CIG GAT GPC GTT GX GIG CGA CTG CTC TGT GAG GGG CIC AGG CAT CCT QX TX AAA CTC ATA GX CTG CjljjGTG, GAC, (JQ 2890 

l.iS P S L K E LDLQQNNLDDVGVR LLCEGLRH PACKL I RLGLDQ 960 

" .-" ft^ ^G GPA CTG ^TC* CTG CTC ATC TTC AX AGA CGG APA CCA AGT GIG ATG AX CCT ACT GPG GGC CTG GAT AX 3000 

TLSDEMRQELRALEQEKPQ LLI FSRRKPSVMT PTEGLDT 1000 

; . ; : ;GGA OG ATG AGT AAT AX ACA TGC TCA CTC AAG GX OG AGA CIC GCA TCA CAG AX GCG GCT TX CAT GTT GCT OG GCT AAT CTC APA CTC CTG GPC GTG AX APG ATC TTC GCA ATT 3120 

^-"G EMSNSTSSLKRQRLGSERA ASHVAQANLKLLDVSKIFPI 1O40 

GCT GPG ATT GCA GPG GAA AX TCC CCA GPG GTA GIA GGG GIG GAA CTC TTG TX GTG OCT TCT CCT GX TCT CAAGX GPC CTG CAT AX PPG GCT TTG GX ACT GPC GAT GPC TIC TGG 3240 

AEIAEESSPEVVPVELLCVP SPASQGDLHTKPLGTDDDFW 1080 

GX OX AX GX CCT GTG GCT ACT OG GTA GTT OC AAA OA APG APC TIG TPC CGA GTT OC TIC OCT GTA GCT GGC TX TPC OX TGG OX PPC PCG GGT CTC TGC TTT GTG AIG AGA 3360 

GPTGPVATEVVDKEKNLYRV HFPVAGSYRWPNTGLC FVMR 1120 

GPA GGG GTG AX GTT OG ATT GPA TTC TGT GTG TGG GPC OG TTC CTG GGT OG ATC AAC CCA CPG CAC AX TGG ATG GTG GCA GX OCT CTG CTGGPC ATC APG GCT GPG OCT GGA OCT 3480 

EAVTVEI EFCVWDQFLGE I N PQHSWMVAGPLLDI KAE PGA -*1G0 

G3G GPA GCT GTG OC CTC OCT OC TTT GTG GCT CTC CPA GGG QX CAT GTG GPC A3\. TX CTG TTC CPA ATG GX OC TTT PPA OG OG GX ATG CTC CIG OG AAG GCAGX MG GTG 3600 

VEAVHLPHFVALQGGHVDTS LFQMAHFKEEGMLLEKPARV 1200 

OG CTG CAT OC ATA GTT CTG GPA AAC CX AX TTC TX GX TTG GGA GTC CIC CTG PPA ATG ATC Off AAT GX CTG GX TTC ATT OX GTC AX TCT GTG GTG TTG CTT TPC OC OX 3720 

ELHHIVLENPSFSPLGVLLK MI HNALRFI PVTSVVLLYHR 1240 

GTC CAT GCT GPG GPA GIC AX TX CPC CX TAC C3G ATC OCA /GT GPC TGC TGC ATT OGG APG CTC ATA CAT CAT CTA CPA CTG AAA TTT! CTC, TTT CTTG CTA ATC CPT! PPC CTA CTT (TTG 3840 

VHPEEVTFHLYLI PSDCSI R KAI DDLEMKFQFVRIHKPPP 1280 

CTTi PCT. CTA C TT IPC AIG GX TCT QGT TTC PCT GTT? TTT fTH TTT OGT *ITA CTTi AIG CTG CPA ATA CTC QX AAG CAA CTG GAG CTC TGC TAT CGA AX OCT GGA CAA PC OG CTG TTC 3960 

LTPLYMGCRYTVSGSGSGML EILPKELELCYRSPGEDQLF 1320 

TOG GG TTC TTC GTT GX OC TTG GGA TCA QX ATC AX CTG CPA GIG AAA GPC APG AAA GAT GPG ACT CTG GIG TGG GPG GX TTG GIG AAA CCA GGA GAT CTC ATG OCT GCA ACT ACT 4080 

SE FYVGH LGSGIRLQVKDKK DETLVWEALVKPGDLMPATT 1360 

CTG ATC GCT GCAGX GX ATA GX GTA GCT TCA GCT CTG CAT GX GX OG TTG CIG GPC TTT GIG GAC OG TAT GGA GG OG CTG ATA GX GGA GIG ACA TX GTG GPG GET GTC TTG 4200 

LI PPARIAVPSP LDAPQLLH FVDQYREQLIARVTSVEVVL 1400 

OC PPA CIG CAT GGA OG GIG CTG AX OG GPG OG TAC OG AX GIG CTG GCT GAG AAC AX AX CCC AX OG ATG CGG AAG CTG TTC AX TTG AX CAG TCC TX GAC GX AAG TX 4320 

DKLHGQVLSQEQYERVLAEN TRPSQMRKLFSLSQSWDRKC 1440 

AAA OT GGA CTC TTC CPA GX CTG AAG GAG AX CAT GCT OC CTC ATT ATG GAA CTC TGG GAG AAG GX AX AAA AAG GGA CTC CTG GCA CTC AX AX TGA 4422 

KDGLYQALKETHPHLIMELW EKGSKKGLLPLSS * 1473 
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<400> 1 

atg get ggc gga gec tgg ggc cgc erg gec tgt tac ttg gag ttc ctg 48 
Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu GIu Phe Leu 
1 5 10 15 

aag aag gag gag ctg aag gag ttc cag ctt ctg etc gec aat aaa gcg 96 
Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

cac tec agg age tct teg ggt gag aca ccc get cag cca gag aag acg 144 
His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 

agt ggc atg gag gtg gec teg tac ctg gtg get cag tat ggg gag cag 192 
Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

egg gee tgg gac eta gec etc cat ace tgg gag cag atg ggg ctg agg 240 
Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Lea Arg 
65 70 75 80 

tea ctg tgc gec caa gee cag gaa ggg gca ggc cac tct ccc tea ttc 288 
Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 



1 



ccc tac age cca agt gaa ccc cac ctg ggg tct ccc age caa ccc acc 336 
Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 

tec acc gca gtg eta atg ccc tgg ate cat gaa ttg ccg gcg ggg tgc 384 
Ser Thr Ala Val Leu Met Pro Trp He His Glu Leu Pro Ala Gly Cys 
115 120 125 

acc cag ggc tea gag aga agg gtt ttg aga cag ctg cct gac aca tct 432 
Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 

gga cgc cgc tgg aga gaa ate tct gec tea etc etc tac caa get ctt 480 
Gly Arg Arg Trp Arg Glu He Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 

cca age tec cca gac cat gag tct cca age cag gag tea ccc aac gec 528 
Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Fro Asn Ala 
165 170 175 

ccc aca tec aca gca gtg ctg ggg age tgg gga tec cca cct cag ccc 57 6 
Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

age eta gca ccc aga gag cag gag gcr cct ggg acc caa tgg cct ctg 624 
Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

gat gaa acg tea gga att tac tac aca gaa ate aga gaa aga gag aga 672 
Asp Glu Thr Ser Gly He Tyr Tyr Thr Glu He Arg Giu Arg Glu Arg 
210 215 220 

gag aaa tea gag aaa ggc agg ccc cca tgg gca gcg gtg gta gga acg 720 
Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

ccc cca cag gcg cac acc age eta cag ccc cac cac cac cca tgg gag 768 
Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

cct tct gtg aga gag age etc tgt tec aca tgg ccc tgg aaa aat gag 816 
Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 

gat ttt aac caa aaa ttc aca cag ctg eta ctt eta caa aga cct cac 864 
Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 
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ccc aga age caa gat ccc ctg gtc aag aga age tgg cct gat tat gtg 912 
Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 

gag gag aat cga gga cat tta att gag ate aga gac tta ttt ggc cca 960 
Glu Glu Asn Arg Gly His Leu He Glu He Arg Asp Leu Phe Gly Pro 
305 310 315 320 

ggc ctg gat ace caa gaa cct cgc ata gtc ata ctg cag ggg get get 1008 
Gly Leu Asp Thr Gin Glu Pro Arg He Val lie Leu Gin Gly Ala Ala 
325 330 335 

gga att ggg aag tea aca ctg gee agg cag gtg aag gaa gee tgg ggg 1056 
Gly lie Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 

aga ggc cag ctg tat ggg gac cgc ttc cag cat gtc ttc tac ttc age 1104 
Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 

tgc aga gag ctg gec cag tec aag gtg gtg agt etc get gag etc ate 1152 
Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu He 
370 375 380 

gga aaa gat ggg aca gee act ccg get ccc att aga cag ate erg tct 1200 
Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro He Arg Gin He Leu Ser 
385 390 395 400 

agg cca gag egg ctg etc ttc ate etc gat ggt gta gat gag cca gga 1248 
Arg Pro Glu Arg Leu Leu Phe He Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 

tgg gtc ttg cag gag ccg agt tct gag etc tgt ctg cac tgg age cag 1296 
Trp Val Leu Gin Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gin 
420 425 430 

cca cag ccg gcg gat gca ctg ctg ggc agt ttg ctg ggg aaa act ata 1344 
Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr He 
435 440 445 

ctt ccc gag gca tec ttc ctg ate acg get egg acc aca get ctg cag 1392 
Leu Pro Glu Ala Ser Phe Leu He Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 

aac etc att cct tct ttg gag cag gca cgt tgg gta gag gtc ctg ggg 1440 
Asn Leu lie Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 
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ttc tct gag tec age agg aag gaa tat ttc tac aga tat ttc aca gat 
Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 



148£ 



gaa agg caa gca att aga gec ttt agg ttg gtc aaa tea aac aaa gag 
Glu Arg Gin Ala He Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 



1536 



etc tgg gee ctg tgt ctt gtg ccc tgg gtg tec tgg ctg gec tgc act 
Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 



1584 



tgc ctg atg cag cag atg aag egg aag gaa aaa etc aca ctg act tec 
Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 



1632 



aag acc acc aca acc etc tgt eta cat tac ctt gec cag get etc caa 
Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 



1680 



get cag cca ttg gga ccc cag etc aga gac etc tgc tct ctg get get 
Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 



1728 



gag ggc ate tgg caa aaa aag acc ctt ttc agt cca gat gac etc agg 
Glu Gly He Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 



1776 



aag cat ggg tta gat ggg gee ate ate tec acc ttc ttg aag atg ggt 
Lys His Gly Leu Asp Gly Ala He lie Ser Thr Phe Leu Lys Met Gly 
595 600 605 



1824 



att ctt caa gag cac ccc ate cct erg age tac age ttc att cac etc 
He Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 



1872 



tgt ttc caa gag ttc ttt gca gca atg tec tat gtc ttg gag gat gag 
Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 



1920 



aag ggg aga ggt aaa cat tct aat tgc ate ata gat ttg gaa aag acg 
Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
645 650 655 



1968 



eta gaa gca tat gga ata cat ggc ctg ttt ggg gca tea acc aca cgt 
Leu Glu Ala Tyr Gly He His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 



2016 
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ttc eta ttg ggc ctg tta agt gat gag ggg gag aga gag atg gag aac 2064 
Phe Leu Leu GLy Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

ate ttt cac tgc egg ctg tct cag ggg agg aac ctg atg cag tgg gtc 2112 
lie Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 

ccg tec ctg cag ctg ctg ctg cag cca cac tct ctg gag tec etc cac 2160 
Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

tgc ttg tac gag act egg aac aaa acg ttc ctg aca caa gtg atg gee 2208 
Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

cat ttc gaa gaa atg ggc atg tgt gta gaa aca gac atg gag etc tta 2256 
His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

gtg tgc act ttc tgc att aaa ttc age cgc cac gtg aag aag ctt cag 2304 
Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

ctg att gag ggc agg cag cac aga tea aca tgg age ccc ace atg gta 2352 
Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

gtc ctg ttc agg tgg gtc cca gtc aca gat gec tat tgg cag att etc 2400 
Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 800 

ttc tec gtc etc aag gtc acc aga aac ctg aag gag ctg gac eta agt 2448 
Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 

gga aac teg ctg age cac tct gca gtg aag agt ctt tgt aag acc ctg 2496 
Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 

aga cgc cct cgc tgc etc ctg gag acc ctg egg ttg get ggc tgt ggc 2544 
Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 

etc aca get gag gac tgc aag gac ctt gee ttt ggg ctg aga gee aac 2592 
Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 
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cag acc ctg acc gag ctg gac ctg age ttc aat gtg etc acg gat get 2640 
Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 

gga gec aaa cac ctt tgc cag aga ctg aga cag ccg age tgc aag eta 2688 
Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

cag cga ctg cag ctg gtc age tgt ggc etc acg tct gac tgc tgc cag 2736 
Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 

gac ctg gec tct gtg ctt agt gec age ccc age ctg aag gag eta gac 2784 
Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 

ctg cag cag aac aac ctg gat gac gtt ggc gtg cga ctg etc tgt gag 2832 
Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

ggg etc agg cat cct gee tgc aaa etc ata cgc ctg ggg ctg gac cag 2880 
Gly Leu Arg His Pro Ala Cys Lys Leu lie Arg Leu Gly Leu Asp Gin 
945 950 955 960 

aca act ctg agt gat gag atg agg cag gaa ctg agg gec ctg gag cag 2928 
Thr Thr Leu Ser Asp Glu Met Arg Gin Glu Leu Arg Ala Leu Glu Gin 
965 970 975 

gag aaa cct cag ctg etc ate ttc age aga egg aaa cca agt gtg atg 2976 
Glu Lys Pro Gin Leu Leu lie Phe Ser Arg Arg Lys Pro Ser Val Met 
980 985 990 

acc cct act gag ggc ctg gat acg gga gag atg agt aat age aca tec 3024 
Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser Thr Ser 
995 1000 1005 

tea etc aag egg cag aga etc gga tea gag agg gcg get tec cat gtt 3072 
Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser His Val 
1010 1015 1020 

get cag get aat etc aaa etc ctg gac gtg age aag ate ttc cca att 3120 
Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys lie Phe Pro lie 
1025 1030 1035 1040 

get gag att gca gag gaa age tec cca gag gta gta ccg gtg gaa etc 3168 
Ala Glu He Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val Glu Leu 
1045 1050 1055 
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ttg tgc gtg cct tct cct gcc tct caa ggg gac ctg cat acg aag cct 3216 
Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr Lys Pro 
1060 1065 1070 

ttg ggg act gac gat gac ttc tgg ggc ccc acg ggg cct gtg get act 3264 
Leu Gly Thr Asp Asp Asp Phe Trp Gly Pro Thr Gly Pro Val Ala Thr 
1075 1080 1085 

gag gta gtt gac aaa gaa aag aac ttg tac cga gtt cac ttc cct gta 3312 
Glu Val Val Asp Lys Glu Lys Asn Leu Tyr Arg Val His Phe Pro Val 
1090 1095 1100 

get ggc tec tac cgc tgg ccc aac acg ggt etc tgc ttt gtg atg aga 3360 
Ala Gly Ser Tyr Arg Trp Pro Asn Thr Gly Leu Cys Phe Val Met Arg 
H05 1110 1115 1120 

gaa gcg gtg acc gtt gag att gaa ttc tgt gtg tgg gac cag ttc ctg 3408 
Glu Ala Val Thr Val Glu He Glu Phe Cys Val Trp Asp Gin Phe Leu 
1125 1130 1135 

ggt gag ate aac cca cag cac age tgg atg gtg gca ggg cct ctg ctg 3456 
Gly Glu He Asn Pro Gin His Ser Trp Met Val Ala Gly Pro Leu Leu 
1140 1145 1150 

gac ate aag get gag cct gga get gtg gaa get gtg cac etc cct cac 3504 
Asp He Lys Ala Glu Pro Gly Ala Val Glu Ala Val His Leu Pro His 
1155 1160 1165 

ttt gtg get etc caa ggg ggc cat gtg gac aca tec ctg ttc caa atg 3552 
Phe Val Ala Leu Gin Gly Gly His Val Asp Thr Ser Leu Phe Gin Met 
1170 1175 1180 

gcc cac ttt aaa gag gag ggg atg etc ctg gag aag cca gcc agg gtg 3600 
Ala His Phe Lys Glu Glu Gly Met Leu Leu Glu Lys Pro Ala Arg Val 
1185 1190 1195 1200 

gag ctg cat cac ata gtt ctg gaa aac ccc age ttc tec ccc ttg gga 3648 
Glu Leu His His He Val Leu Glu Asn Pro Ser Phe Ser Pro Leu Gly 
1205 1210 1215 

gtc etc ctg aaa atg ate cat aat gcc ctg cgc ttc att ccc gtc acc 3696 
Val Leu Leu Lys Met He His Asn Ala Leu Arg Phe He Pro Val Thr 
1220 1225 1230 

tct gtg gtg ttg ctt tac cac cgc gtc cat cct gag gaa gtc acc ttc 3744 
Ser Val Val Leu Leu Tyr His Arg Val His Pro Glu Glu Val Thr Phe 
1235 1240 1245 
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cac etc tac ctg ate cca agt gac tgc tec att egg aag gec ata gat 3792 
His Leu Tyr Leu lie Pro Ser Asp Cys Ser lie Arg Lys Ala He Asp 
1250 1255 1260 

gat eta gaa atg aaa ttc cag ttt gtg cga ate cac aag cca ccc ccg 3840 
Asp Leu Glu Met Lys Phe Gin Phe Val Arg He His Lys Pro Pro Pro 
1265 1270 1275 1280 

ctg acc cca ctt tat atg ggc tgt cgt tac act gtg tct ggg tct ggt 3888 
Leu Thr Pro Leu Tyr Met Gly Cys Arg Tyr Thr Val Ser Gly Ser Gly 
1285 1290 1295 

tea ggg atg ctg gaa ata etc ccc aag gaa ctg gag etc tgc tat cga 3936 
Ser Gly Met Leu Glu He Leu Pro Lys Glu Leu Glu Leu Cys Tyr Arg 
1300 1305 1310 

age cct gga gaa gac cag ctg ttc teg gag ttc tac gtt ggc cac ttg 3984 
Ser Pro Gly Glu Asp Gin Leu Phe Ser Glu Phe Tyr Val Gly His Leu 
1315 1320 1325 

gga tea ggg ate agg ctg caa gtg aaa gac aag aaa gat gag act ctg 4032 
Gly Ser Gly He Arg Leu Gin Val Lys Asp Lys Lys Asp Glu Thr Leu 
1330 1335 1340 

gtg tgg gag gec ttg gtg aaa cca gga gat etc atg cct gca act act 4080 
Val Trp Glu Ala Leu Val Lys Pro Gly Asp Leu Met Pro Ala Thr Thr 
1345 1350 1355 1360 

ctg ate cct cca gec cgc ata gee gta cct tea cct ctg gat gee ccg 4128 
Leu He Pro Pro Ala Arg He Ala Val Pro Ser Pro Leu Asp Ala Pro 
1365 1370 1375 

cag ttg ctg cac ttt gtg gac cag tat cga gag cag ctg ata gee cga 4176 
Gin Leu Leu His Phe Val Asp Gin Tyr Arg Glu Gin Leu He Ala Arg 
1380 1385 1390 

gtg aca teg gtg gag gtt gtc ttg gac aaa ctg cat gga cag gtg ctg 4224 
Val Thr Ser Val Glu Val Val Leu Asp Lys Leu His Gly Gin Val Leu 
1395 1400 1405 

age cag gag cag tac gag agg gtg ctg get gag aac acg agg ccc age 4272 
Ser Gin Glu Gin Tyr Glu Arg Val Leu Ala Glu Asn Thr Arg Pro Ser 
1410 1415 1420 

cag atg egg aag ctg ttc age ttg age cag tec tgg gac egg aag tgc 4320 
Gin Met Arg Lys Leu Phe Ser Leu Ser Gin Ser Trp Asp Arg Lys Cys 
1425 1430 1435 1440 
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aaa gat gga etc tac caa gec ctg aag gag acc cat cct cac etc att 4368 
Lys Asp Gly Leu Tyr Gin Ala Leu Lys Glu Thr His Pro His Leu He 
1445 1450 1455 



atg gaa etc tgg gag aag ggc age aaa aag gga etc ctg cca etc age 4416 
Met Glu Leu Trp Glu Lys Gly Ser Lys Lys Gly Leu Leu Pro Leu Ser 
1460 1465 1470 

age tga 4422 
Ser 



<210> 2 
<211> 1473 
<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Ala Gly Gly 

1 

Lys Lys Glu Glu 

20 

His Ser Arg Ser 
35 

Ser Gly Met Glu 
50 

Arg Ala Trp Asp 
65 

Ser Leu Cys Ala 



Pro Tyr Ser Pro 
100 

Ser Thr Ala Val 
115 

Thr Gin Gly Ser 
130 

Gly Arg Arg Trp 
145 



Ala Trp Gly Arg 
5 

Leu Lys Glu Phe 



Ser Ser Gly Glu 
40 

Val Ala Ser Tyr 
55 

Leu Ala Leu His 
70 

Gin Ala Gin Glu 
85 

Ser Glu Pro His 



Leu Met Pro Trp 
120 

Glu Arg Arg Val 
135 

Arg Glu He Ser 
150 



Leu Ala Cys Tyr 
10 

Gin Leu Leu Leu 

25 

Thr Pro Ala Gin 



Leu Val Ala Gin 
60 

Thr Trp Glu Gin 
75 

Gly Ala Gly His 
90 

Leu Gly Ser Pro 
105 

He His Glu Leu 



Leu Arg Gin Leu 
140 

Ala Ser Leu Leu 
155 



Leu Glu Phe Leu 
15 

Ala Asn Lys Ala 
30 

Pro Glu Lys Thr 
45 

Tyr Gly Glu Gin 



Met Gly Leu Arg 
80 

Ser Pro Ser Phe 
95 

Ser Gin Pro Thr 
110 

Pro Ala Gly Cys 
125 

Pro Asp Thr Ser 



Tyr Gin Ala Leu 
160 
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Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 



Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

Asp Glu Thr Ser Gly He Tyr Tyr Thr Glu He Arg Glu Arg Glu Arg 
210 215 220 

Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 

Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 

Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 

Glu Glu Asn Arg Gly His 'Leu He Glu He Arg Asp Leu Phe Gly Pro 
305 310 315 320 

Gly Leu Asp Thr Gin Glu Pro Arg He Val He Leu Gin Gly Ala Ala 
325 330 335 

Gly He Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 

Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 

Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu He 
370 375 380 

Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro He Arg Gin He Leu Ser 
385 390 395 400 



Arg Pro Glu Arg Leu Leu Phe He Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 
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Trp Val Leu Gin 
420 

Pro Gin Pro Ala 
435 

Leu Pro Glu Ala 
450 

Asn Leu lie Pro 
465 

Phe Ser Glu Ser 



Glu Arg Gin Ala 
500 

Leu Trp Ala Leu 
515 

Cys Leu Met Gin 
530 

Lys Thr Thr Thr 
545 

Ala Gin Pro Leu 



Glu Gly He Trp 
580 

Lys His Gly Leu 
595 

He Leu Gin Glu 
610 

Cys Phe Gin Glu 
625 

Lys Gly Arg Gly 



Leu Glu Ala Tyr 
660 



Glu Pro Ser Ser 



Asp Ala Leu Leu 
440 

Ser Phe Leu He 
455 

Ser Leu Glu Gin 
470 

Ser Arg Lys Glu 
485 

He Arg Ala Phe 



Cys Leu Val Pro 
520 

Gin Met Lys Arg 
535 

Thr Leu Cys Leu 
550 

Gly Pro Gin Leu 
565 

Gin Lys Lys Thr 



Asp Gly Ala He 
600 

His Pro He Pro 
615 

Phe Phe Ala Ala 
630 

Lys His Ser Asn 
645 

Gly He His Gly 



Glu Leu Cys Leu 
425 

Gly Ser Leu Leu 



Thr Ala Arg Thr 
460 

Ala Arg Trp Val 
475 

Tyr Phe Tyr Arg 
490 

Arg Leu Val Lys 
505 

Trp Val Ser Trp 



Lys Glu Lys Leu 
540 

His Tyr Leu Ala 
555 

Arg Asp Leu Cys 
570 

Leu Phe Ser Pro 
585 

He Ser Thr Phe 



Leu Ser Tyr Ser 
620 

Met Ser Tyr Val 
635 

Cys lie He Asp 
650 

Leu Phe Gly Ala 
665 



His Trp Ser Gin 
430 

Gly Lys Thr He 
445 

Thr Ala Leu Gin 



Glu Val Leu Gly 
480 

Tyr Phe Thr Asp 
495 

Ser Asn Lys Glu 
510 

Leu Ala Cys Thr 
525 

Thr Leu Thr Ser 



Gin Ala Leu Gin 
560 

Ser Leu Ala Ala 
575 

Asp Asp Leu Arg 
590 

Leu Lys Met Gly 
605 

Phe He His Leu 



Leu Glu Asp Glu 
640 

Leu Glu Lys Thr 
655 

Ser Thr Thr Arg 
670 
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Phe Leu Leu Gly 
675 

lie Phe His Cys 
690 

Pro Ser Leu Gin 
705 

Cys Leu Tyr Glu 



His Phe Glu Glu 
740 

Val Cys Thr Phe 
755 

Leu lie Glu Gly 

770 

Val Leu Phe Arg 
785 

Phe Ser Val Leu 



Gly Asn Ser Leu 
820 

Arg Arg Pro Arg 
835 

Leu Thr Ala Glu 

850 

Gin Thr Leu Thr 
865 

Gly Ala Lys His 



Gin Arg Leu Gin 
900 

Asp Leu Ala Ser 
915 



Leu Leu Ser Asp 
680 

Arg Leu Ser Gin 
695 

Leu Leu Leu Gin 
710 

Thr Arg Asn Lys 
725 

Met Gly Met Cys 



Cys lie Lys Phe 
760 

Arg Gin His Arg 
775 

Trp Val Pro Val 
790 

Lys Val Thr Arg 
805 

Ser His Ser Ala 



Cys Leu Leu Glu 
840 

Asp Cys Lys Asp 
855 

Glu Leu Asp Leu 

870 

Leu Cys Gin Arg 
885 

Leu Val Ser Cys 



Val Leu Ser Ala 
920 



Glu Gly Glu Arg 



Gly Arg Asn Leu 
700 

Pro His Ser Leu 
715 

Thr Phe Leu Thr 
730 

Val Glu Thr Asp 
745 

Ser Arg His Val 



Ser Thr Trp Ser 
780 

Thr Asp Ala Tyr 
795 

Asn Leu Lys Glu 
810 

Val Lys Ser Leu 

825 

Thr Leu Arg Leu 



Leu Ala Phe Gly 
860 

Ser Phe Asn Val 
875 

Leu Arg Gin Pro 
890 

Gly Leu Thr Ser 
905 

Ser Pro Ser Leu 



Glu Met Glu Asn 
685 

Met Gin Trp Val 



Glu Ser Leu His 

720 

Gin Val Met Ala 
735 

Met Glu Leu Leu 
750 

Lys Lys Leu Gin 
765 

Pro Thr Met Val 



Trp Gin lie Leu 
800 

Leu Asp Leu Ser 
815 

Cys Lys Thr Leu 
830 

Ala Gly Cys Gly 
845 

Leu Arg Ala Asn 



Leu Thr Asp Ala 
880 

Ser Cys Lys Leu 
895 

Asp Cys Cys Gin 
910 

Lys Glu Leu Asp 
925 
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Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 



Gly Leu Arg His Pro Ala Cys Lys Leu He Arg Leu Gly Leu Asp Gin 
945 950 955 960 

Thr Thr Leu Ser Asp Glu Met Arg Gin Glu Leu Arg Ala Leu Glu Gin 
965 970 975 

Glu Lys Pro Gin Leu Leu He Phe Ser Arg Arg Lys Pro Ser Val Met 
980 985 990 

Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser Thr Ser 
995 1000 1005 

Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser His Val 
1010 1015 1020 

Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys He Phe Pro He 
025 1030 1035 1040 

Ala Glu He Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val Glu Leu 
1045 1050 1055 

Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr Lys Pro 
1060 1065 1070 

Leu Gly Thr Asp Asp Asp Phe Trp Gly Pro Thr Gly Pro Val Ala Thr 
1075 1080 1085 

Glu Val Val Asp Lys Glu Lys Asn Leu Tyr Arg Val His Phe Pro Val 
1090 1095 1100 

Ala Gly Ser Tyr Arg Trp Pro Asn Thr Gly Leu Cys Phe Val Met Arg 
105 1110 1115 1120 

Glu Ala Val Thr Val Glu He Glu Phe Cys Val Trp Asp Gin Phe Leu 
1125 1130 H35 

Gly Glu He Asn Pro Gin His Ser Trp Met Val Ala Gly Pro Leu Leu 
1140 1145 H50 

Asp He Lys Ala Glu Pro Gly Ala Val Glu Ala Val His Leu Pro His 
1155 1160 1165 



Phe Val Ala Leu Gin Gly Gly His Val Asp Thr Ser Leu Phe Gin Met 
1170 1175 H80 
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Ala His Phe Lys Glu Glu Gly Met Leu Leu Glu Lys Pro Ala Arg Val 
185 1190 1195 1200 



Glu Leu His His lie Val Leu Glu Asn Pro Ser Phe Ser Pro Leu Gly 
1205 1210 1215 

Val Leu Leu Lys Met lie His Asn Ala Leu Arg Phe He Pro Val Thr 
1220 1225 1230 

Ser Val Val Leu Leu Tyr His Arg Val His Pro Glu Glu Val Thr Phe 
1235 1240 1245 

His Leu Tyr Leu He Pro Ser Asp Cys Ser He Arg Lys Ala He Asp 
1250 1255 1260 

Asp Leu Glu Met Lys Phe Gin Phe Val Arg He His Lys Pro Pro Pro 
265 1270 1275 1280 

Leu Thr Pro Leu Tyr Met Gly Cys Arg Tyr Thr Val Ser Gly Ser Gly 
1285 1290 1295 

Ser Gly Met Leu Glu He Leu Pro Lys Glu Leu Glu Leu Cys Tyr Arg 
1300 1305 1310 

Ser Pro Gly Glu Asp Gin Leu Phe Ser Glu Phe Tyr Val Gly His Leu 
1315 1320 1325 

Gly Ser Gly He Arg Leu Gin Val Lys Asp Lys Lys Asp Glu Thr Leu 
1330 1335 1340 

Val Trp Glu Ala Leu Val Lys Pro Gly Asp Leu Met Pro Ala Thr Thr 
345 1350 1355 1360 

Leu He Pro Pro Ala Arg He Ala Val Pro Ser Pro Leu Asp Ala Pro 
1365 1370 1375 

Gin Leu Leu His Phe Val Asp Gin Tyr Arg Glu Gin Leu He Ala Arg 
1380 1385 1390 

Val Thr Ser Val Glu Val Val Leu Asp Lys Leu His Gly Gin Val Leu 
1395 1400 1405 

Ser Gin Glu Gin Tyr Glu Arg Val Leu Ala Glu Asn Thr Arg Pro Ser 
1410 1415 1420 

Gin Met Arg Lys Leu Phe Ser Leu Ser Gin Ser Trp Asp Arg Lys Cys 
425 1430 1435 1440 
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Lys Asp Gly Leu Tyr Gin Ala Leu Lys Glu Thr His Pro His Leu He 
1445 1450 1455 

Met Glu Leu Trp Glu Lys Gly Ser Lys Lys Gly Leu Leu Pro Leu Ser 
1460 1465 1470 



<210> 3 
<211> 4200 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (4200) 

<400> 3 

atg get ggc gga gec tgg ggc cgc ctg gec tgt tac ttg gag ttc ctg 48 

Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
15 10 15 

aag aag gag gag ctg aag gag ttc cag ctt ctg etc gec aat aaa gcg 96 
Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

cac tec agg age tct teg ggt gag aca ccc get cag cca gag aag acg 144 
His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 

agt ggc atg gag gtg gec teg tac ctg gtg get cag tat ggg gag cag 192 
Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

egg gec tgg gac eta gee etc cat acc tgg gag cag atg ggg ctg agg 240 
Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Leu Arg 
65 70 75 80 

tea ctg tgc gec caa gee cag gaa ggg gca ggc cac tct ccc tea ttc 288 
Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 

ccc tac age cca agt gaa ccc cac ctg ggg tct ccc age caa ccc acc 336 
Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 
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tec acc gca gtg eta atg ccc tgg ate cat gaa ttg ccg gcg ggg tgc 384 
Ser Thr Ala Val Leu Met Pro Trp lie His Glu Leu Pro Ala Gly Cys 
115 120 125 

acc cag ggc tea gag aga agg gtt ttg aga cag ctg cct gac aca tct 432 
Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 

gga cgc cgc tgg aga gaa ate tct gec tea etc etc tac caa get ctt 480 
Gly Arg Arg Trp Arg Glu lie Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 

cca age tec cca gac cat gag tct cca age cag gag tea ccc aac gee 528 
Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 

ccc aca tec aca gca gtg ctg ggg age tgg gga tec cca cct cag ccc 576 
Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

age eta gca ccc aga gag cag gag get cct ggg acc caa tgg cct ctg 624 
Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

gat gaa acg tea gga att tac tac aca gaa ate aga gaa aga gag aga 672 
Asp Glu Thr Ser Gly He Tyr Tyr Thr Glu He Arg Glu Arg Glu Arg 
210 215 220 

gag aaa tea gag aaa ggc agg ccc cca tgg gca gcg gtg gta gga acg 720 
Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

ccc cca cag gcg cac acc age eta cag ccc cac cac cac cca tgg gag 768 
Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

cct tct gtg aga gag age etc tgt tec aca tgg ccc tgg aaa aat gag 816 
Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 

gat ttt aac caa aaa ttc aca cag ctg eta ctt eta caa aga cct cac 864 
Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 

ccc aga age caa gat ccc ctg gtc aag aga age tgg cct gat tat gtg 912 
Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 
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gag gag aat cga gga cat tta att gag ate aga gac tta ttt ggc cca 960 
Glu Glu Asn Arg Gly His Leu lie Glu lie Arg Asp Leu Phe Gly Pro 
305 310 315 320 

ggc ctg gat acc caa gaa cct cgc ata gtc ata ctg cag ggg get get 1008 
Gly Leu Asp Thr Gin Glu Pro Arg He Val He Leu Gin Gly Ala Ala 
325 330 335 

gga att ggg aag tea aca ctg gee agg cag gtg aag gaa gec tgg ggg 1056 
Gly He Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 

aga ggc cag ctg tat ggg gac cgc ttc cag cat gtc ttc tac ttc age 1104 
Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 

tgc aga gag ctg gec cag tec aag gtg gtg agt etc get gag etc ate 1152 
Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu He 
370 375 380 

gga aaa gat ggg aca gee act ccg get ccc att aga cag ate ctg tct 1200 
Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro He Arg Gin He Leu Ser 
385 390 395 400 

agg cca gag egg ctg etc ttc ate etc gat ggt gta gat gag cca gga 1248 
Arg Pro Glu Arg Leu Leu Phe lie Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 

tgg gtc ttg cag gag ccg agt tct gag etc tgt ctg cac tgg age cag 1296 
Trp Val Leu Gin Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gin 
420 425 430 

cca cag ccg gcg gat gca ctg ctg ggc agt ttg ctg ggg aaa act ata 1344 
Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr He 
435 440 445 

ctt ccc gag gca tec ttc ctg ate acg get egg acc aca get ctg cag 1392 
Leu Pro Glu Ala Ser Phe Leu He Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 

aac etc att cct tct ttg gag cag gca cgt tgg gta gag gtc ctg ggg 1440 
Asn Leu He Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 

ttc tct gag tec age agg aag gaa tat ttc tac aga tat ttc aca gat 1488 
Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 
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gaa agg caa gca att aga gcc ttt agg ttg gtc aaa tea aac aaa gag 1536 
Glu Arg Gin Ala lie Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 

etc tgg gcc ctg tgt ctt gtg ccc tgg gtg tec tgg ctg gcc tgc act 1584 
Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 

tgc ctg atg cag cag atg aag egg aag gaa aaa etc aca ctg act tec 1632 
Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 

aag acc acc aca ace etc tgt eta cat tac ctt gcc cag get etc caa 1680 
Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

get cag cca ttg gga ccc cag etc aga gac etc tgc tct ctg get get 1728 
Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

gag ggc ate tgg caa aaa aag acc ctt ttc agt cca gat gac etc agg 1776 
Glu Gly lie Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 

aag cat ggg tta gat ggg gcc ate ate tec acc ttc ttg aag atg ggt 1824 
Lys His Gly Leu Asp Gly Ala He He Ser Thr Phe Leu Lys Met Gly 
595 600 605 

att ctt caa gag cac ccc ate cct ctg age tac age ttc att cac etc 1872 
He Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe lie His Leu 
610 615 620 

tgt ttc caa gag ttc ttt gca gca atg tec tat gtc ttg gag gat gag 1920 
Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 

aag ggg aga ggt aaa cat tct aat tgc ate ata gat ttg gaa aag acg 1968 
Lys Gly Arg Gly Lys His Ser Asn Cys He lie Asp Leu Glu Lys Thr 
645 650 655 

eta gaa gca tat gga ata cat ggc ctg ttt ggg gca tea acc aca cgt 2016 
Leu Glu Ala Tyr Gly He His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 

ttc eta ttg ggc ctg tta agt gat gag ggg gag aga gag atg gag aac 2064 
Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 
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ate ttt cac tgc egg ctg tct cag ggg agg aac ctg atg cag tgg gtc 
lie Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 



2112 



ccg tec ctg cag ctg ctg ctg cag cca cac tct ctg gag tec etc cac 
Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 



2160 



tgc ttg tac gag act egg aac aaa acg ttc ctg aca caa gtg atg gee 
Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 



2208 



cat ttc gaa gaa atg ggc atg tgt gta gaa aca gac atg gag etc tta 
His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 



2256 



gtg tgc act ttc tgc att aaa ttc age cgc cac gtg aag aag ctt cag 
Val Cys Thr Phe Cys lie Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 



2304 



ctg att gag ggc agg cag cac aga tea aca tgg age ccc acc atg gta 
Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
77 0 775 780 



2352 



gtc ctg ttc agg tgg gtc cca gtc aca gat gee tat tgg cag att etc 
Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 



00 



2400 



ttc tec gtc etc aag gtc acc aga aac ctg aag gag ctg gac eta agt 
Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 



2448 



gga aac teg ctg age cac tct gca gtg aag agt ctt tgt aag acc ctg 
Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 



2496 



aga cgc cct cgc tgc etc ctg gag acc ctg egg ttg get ggc tgt ggc 
Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 



2544 



etc aca get gag gac tgc aag gac ctt gec ttt ggg ctg aga gee aac 
Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 



2592 



cag acc ctg acc gag ctg gac ctg age ttc aat gtg etc acg gat get 
Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 



2640 
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gga gcc aaa cac ctt tgc cag aga ctg aga cag ccg age tgc aag eta 2688 
Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

cag cga ctg cag ctg gtc age tgt ggc etc acg tct gac tgc tgc cag 2736 
Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 

gac ctg gcc tct gtg ctt agt gcc age ccc age ctg aag gag eta gac 2784 
Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 

ctg cag cag aac aac ctg gat gac gtt ggc gtg cga ctg etc tgt gag 2832 
Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

ggg etc agg cat cct gcc tgc aaa etc ata cgc ctg ggg aaa cca agt 2880 
Gly Leu Arg His Pro Ala Cys Lys Leu He Arg Leu Gly Lys Pro Ser 
945 950 955 960 

gtg atg ace cct act gag ggc ctg gat acg gga gag atg agt aat age 2928 
Val Met Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser 
965 970 975 

aca tec tea etc aag egg cag aga etc gga tea gag agg gcg get tec 2976 
Thr Ser Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser 
980 985 990 

cat gtt get cag get aat etc aaa etc ctg gac gtg age aag ate ttc 3024 
His Val Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys He Phe 
995 1000 1005 

cca att get gag att gca gag gaa age tec cca gag gta gta ccg gtg 3072 
Pro He Ala Glu He Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val 
1010 1015 1020 

gaa etc ttg tgc gtg cct tct cct gcc tct caa ggg gac ctg cat acg 3120 
Glu Leu Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr 
1025 1030 1035 1040 

aag cct ttg ggg act gac gat gac ttc tgg ggc ccc acg ggg cct gtg 3168 
Lys Pro Leu Gly Thr Asp Asp Asp Phe Trp Gly Pro Thr Gly Pro Val 
1045 1050 1055 

get act gag gta gtt gac aaa gaa aag aac ttg tac cga gtt cac ttc 3216 
Ala Thr Glu Val Val Asp Lys Glu Lys Asn Leu Tyr Arg Val His Phe 
1060 1065 1070 



20 



cct gta get ggc tec tac cgc tgg ccc aac acg ggt etc tgc ttt gtg 
Pro Val Ala Gly Ser Tyr Arg Trp Pro Asn Thr Gly Leu Cys Phe Val 
1075 1080 1085 



3264 



atg aga gaa gcg gtg acc gtt gag att gaa ttc tgt gtg tgg gac cag 
Met Arg Glu Ala Val Thr Val Glu He Glu Phe Cys Val Trp Asp Gin 
1090 1095 HOO 



3312 



ttc ctg ggt gag ate aac cca cag cac age tgg atg gtg gca ggg cct 
Phe Leu Gly Glu He Asn Pro Gin His Ser Trp Met Val Ala Gly Pro 
H05 IHO H15 H20 



3360 



ctg ctg gac ate aag get gag cct gga get gtg gaa get gtg cac etc 
Leu Leu Asp He Lys Ala Glu Pro Gly Ala Val Glu Ala Val His Leu 
1125 H30 H35 



3408 



cct cac ttt gtg get etc caa ggg ggc cat gtg gac aca tec ctg ttc 
Pro His Phe Val Ala Leu Gin Gly Gly His Val Asp Thr Ser Leu Phe 
1140 H45 H50 



3456 



caa atg gee cac ttt aaa gag gag ggg atg etc ctg gag aag cca gee 
Gin Met Ala His Phe Lys Glu Glu Gly Met Leu Leu Glu Lys Pro Ala 
1155 H60 H65 



3504 



agg gtg gag ctg cat cac ata gtt ctg gaa aac ccc age ttc tec ccc 
Arg Val Glu Leu His His lie Val Leu Glu Asn Pro Ser Phe Ser Pro 
1170 H75 1180 



3552 



ttg gga gtc etc ctg aaa atg ate cat aat gec ctg cgc ttc att ccc 
Leu Gly Val Leu Leu Lys Met He His Asn Ala Leu Arg Phe He Pro 
1135 1190 H95 1200 



3600 



gtc acc tct gtg gtg ttg ctt tac cac cgc gtc cat cct gag gaa gtc 
Val Thr Ser Val Val Leu Leu Tyr His Arg Val His Pro Glu Glu Val 
1205 1210 1215 



3648 



acc trc cac etc tac ctg ate cca agt gac tgc tec att egg aag gaa 
Thr Phe His Leu Tyr Leu He Pro Ser Asp Cys Ser He Arg Lys Glu 
1220 1225 1230 



3696 



ctg gag etc tgc tat cga age cct gga gaa gac cag ctg ttc teg gag 
Leu Glu Leu Cys Tyr Arg Ser Pro Gly Glu Asp Gin Leu Phe Ser Glu 
1235 1240 1245 



3744 



ttc tac gtt ggc cac ttg gga tea ggg ate agg ctg caa gtg aaa gac 
Phe Tyr Val Gly His Leu Gly Ser Gly He Arg Leu Gin Val Lys Asp 
1250 1255 1260 



3792 
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aag aaa gat gag act ctg gtg tgg gag gcc ttg gtg aaa cca gga gat 
Lys Lys Asp Glu Thr Leu Val Trp Glu Ala Leu Val Lys Pro Gly Asp 
1265 1270 1275 1280 



3840 



etc atg cct gca act act ctg ate cct cca gcc cgc ata gcc gta cct 
Leu Met Pro Ala Thr Thr Leu He Pro Pro Ala Arg He Ala Val Pro 
1285 1290 1295 



3888 



tea cct ctg gat gcc ccg cag ttg ctg cac ttt gtg gac cag tat cga 
Ser Pro Leu Asp Ala Pro Gin Leu Leu His Phe Val Asp Gin Tyr Arg 
1300 1305 1310 



3936 



gag cag ctg ata gcc cga gtg aca teg gtg gag gtt gtc ttg gac aaa 
Glu Gin Leu He Ala Arg Val Thr Ser Val Glu Val Val Leu Asp Lys 
13 15 1320 1325 



3984 



ctg cat gga cag gtg ctg age cag gag cag tac gag agg gtg ctg get 
Leu His Gly Gin Val Leu Ser Gin Glu Gin Tyr Glu Arg Val Leu Ala 
1330 1335 1340 



4032 



gag aac acg agg ccc age cag atg egg aag ctg ttc age ttg age cag 
Glu Asn Thr Arg Pro Ser Gin Met Arg Lys Leu Phe Ser Leu Ser Gin 
!345 1350 1355 i3 60 



4080 



tec tgg gac egg aag tgc aaa gat gga etc tac caa gee ctg aag gag 
Ser Trp Asp Arg Lys Cys Lys Asp Gly Leu Tyr Gin Ala Leu Lys Glu 
1365 1370 1375 



4128 



acc cat cct cac etc att atg gaa etc tgg gag aag ggc age aaa aag 
Thr His Pro His Leu He Met Glu Leu Trp Glu Lys Gly Ser Lys Lys 
1380 1385 1390 



4176 



gga etc ctg cca etc age age tga 
Gly Leu Leu Pro Leu Ser Ser 

1395 1400 



4200 



<210> 4 
<211> 1399 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
1 5 io is 

Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 
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His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 



Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Leu Arg 
65 70 75 80 

Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 

Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 

Ser Thr Ala Val Leu Met Pro Trp He His Glu Leu Pro Ala Gly Cys 
115 120 125 

Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 

Gly Arg Arg Trp Arg Glu He Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 

Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 

Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

Asp Glu Thr Ser Gly He Tyr Tyr Thr Glu Tie Arg Glu Arg Glu Arg 
210 215 220 

Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 



Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 
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Pro Arg Ser Gin Asp 
290 

Glu Glu Asn Arg Gly 
305 

Gly Leu Asp Thr Gin 
325 

Gly He Gly Lys Ser 
340 

Arg Gly Gin Leu Tyr 
355 

Cys Arg Glu Leu Ala 
370 

Gly Lys Asp Gly Thr 
385 

Arg Pro Glu Arg Leu 
405 

Trp Val Leu Gin Glu 
420 

Pro Gin Pro Ala Asp 
435 

Leu Pro Glu Ala Ser 
450 

Asn Leu He Pro Ser 
465 

Phe Ser Glu Ser Ser 
485 

Glu Arg Gin Ala He 
500 

Leu Trp Ala Leu Cys 
515 

Cys Leu Met Gin Gin 
530 



Pro Leu Val Lys Arg Ser 
295 

His Leu He Glu He Arg 

310 315 

Glu Pro Arg He Val He 
330 

Thr Leu Ala Arg Gin Val 
345 

Gly Asp Arg Phe Gin His 
360 

Gin Ser Lys Val Val Ser 
375 

Ala Thr Pro Ala Pro He 
390 395 

Leu Phe He Leu Asp Gly 
410 

Pro Ser Ser Glu Leu Cys 
425 

Ala Leu Leu Gly Ser Leu 
440 

Phe Leu He Thr Ala Arg 
455 

Leu Glu Gin Ala Arg Trp 
470 475 

Arg Lys Glu Tyr Phe Tyr 
490 

Arg Ala Phe Arg Leu Val 
505 

Leu Val Pro Trp Val Ser 
520 

Met Lys Arg Lys Glu Lys 
535 



Trp Pro Asp Tyr Val 
300 

Asp Leu Phe Gly Pro 
320 

Leu Gin Gly Ala Ala 

335 

Lys Glu Ala Trp Gly 

350 

Val Phe Tyr Phe Ser 
365 

Leu Ala Glu Leu He 
380 

Arg Gin He Leu Ser 
400 

Val Asp Glu Pro Gly 
415 

Leu His Trp Ser Gin 
430 

Leu Gly Lys Thr He 
445 

Thr Thr Ala Leu Gin 
460 

Val Glu Val Leu Gly 
480 

Arg Tyr Phe Thr Asp 
495 

Lys Ser Asn Lys Glu 
510 

Trp Leu Ala Cys Thr 
525 

Leu Thr Leu Thr Ser 
540 



24 



Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

Glu Gly He Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 

Lys His Gly Leu Asp Gly Ala He He Ser Thr Phe Leu Lys Met Gly 
595 600 605 

He Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 

Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 

Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
645 650 655 

Leu Glu Ala Tyr Gly He His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 

Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

Tie Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 

Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 



Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 800 



25 



Phe Ser Val Leu 



Gly Asn Ser Leu 
820 

Arg Arg Pro Arg 
835 

Leu Thr Ala Glu 
850 

Gin Thr Leu Thr 
865 

Gly Ala Lys His 



Gin Arg Leu Gin 

900 



Lys Val Thr Arg 
805 

Ser His Ser Ala 



Cys Leu Leu Glu 
840 

Asp Cys Lys Asp 
855 

Glu Leu Asp Leu 
870 

Leu Cys Gin Arg 
885 

Leu Val Ser Cys 



Asn Leu Lys Glu 
810 

Val Lys Ser Leu 
825 

Thr Leu Arg Leu 



Leu Ala Phe Gly 
860 

Ser Phe Asn Val 
875 

Leu Arg Gin Pro 
890 

Gly Leu Thr Ser 
905 



Leu Asp Leu Ser 
815 

Cys Lys Thr Leu 
830 

Ala Gly Cys Gly 
845 

Leu Arg Ala Asn 



Leu Thr Asp Ala 

880 

Ser Cys Lys Leu 
895 

Asp Cys Cys Gin 
910 



Asp Leu Ala Ser Val 
915 

Leu Gin Gin Asn Asn 
930 

Gly Leu Arg His Pro 
945 

Val Met Thr Pro Thr 
965 

Thr Ser Ser Leu Lys 
980 



Leu Ser Ala Ser Pro Ser 
920 

Leu Asp Asp Val Gly Val 
935 

Ala Cys Lys Leu lie Arg 
950 955 

Glu Gly Leu Asp Thr Gly 
970 

Arg Gin Arg Leu Gly Ser 
985 



Leu Lys Glu Leu Asp 
925 

Arg Leu Leu Cys Glu 
940 

Leu Gly Lys Pro Ser 
960 

Glu Met Ser Asn Ser 
975 

Glu Arg Ala Ala Ser 
990 



His Val Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys lie Phe 
995 1000 1005 

Pro lie Ala Glu lie Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val 
1010 1015 1020 



Glu Leu Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr 

°25 1030 1035 1040 

Lys Pro Leu Gly Thr Asp Asp Asp Phe Trp Gly Pro Thr Gly Pro Val 

1045 1050 1055 
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Ala Thr Glu Val Val Asp Lys Glu Lys Asn Leu Tyr Arg Val His Phe 
1060 1065 1070 



Pro Val Ala Gly Ser Tyr Arg Trp Pro Asn Thr Gly Leu Cys Phe Val 
1075 1080 1085 

Met Arg Glu Ala Val Thr Val Glu He Glu Phe Cys Val Trp Asp Gin 
1090 1095 HOO 

Phe Leu Gly Glu He Asn Pro Gin His Ser Trp Met Val Ala Gly Pro 
105 1110 1115 1120 

Leu Leu Asp He Lys Ala Glu Pro Gly Ala Val Glu Ala Val His Leu 
1125 1130 1135 

Pro His Phe Val Ala Leu Gin Gly Gly His Val Asp Thr Ser Leu Phe 
1140 1145 1150 

Gin Met Ala His Phe Lys Glu Glu Gly Met Leu Leu Glu Lys Pro Ala 
1155 1160 1165 

Arg Val Glu Leu His His He Val Leu Glu Asn Pro Ser Phe Ser Pro 
1170 1175 1180 

Leu Gly Val Leu Leu Lys Met He His Asn Ala Leu Arg Phe He Pro 
185 1190 1195 1200 

Val Thr Ser Val Val Leu Leu Tyr His Arg Val His Pro Glu Glu Val 
1205 1210 1215 

Thr Phe His Leu Tyr Leu He Pro Ser Asp Cys Ser lie Arg Lys Glu 
1220 1225 1230 

Leu Glu Leu Cys Tyr Arg Ser Pro Gly Glu Asp Gin Leu Phe Ser Glu 
1235 1240 1245 

Phe Tyr Val Gly His Leu Gly Ser Gly He Arg Leu Gin Val Lys Asp 
1250 1255 1260 

Lys Lys Asp Glu Thr Leu Val Trp Glu Ala Leu Val Lys Pro Gly Asp 
265 1270 1275 1280 

Leu Met Pro Ala Thr Thr Leu He Pro Pro Ala Arg He Ala Val Pro 
1285 1290 1295 

Ser Pro Leu Asp Ala Pro Gin Leu Leu His Phe Val Asp Gin Tyr Arg 
1300 1305 1310 
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Glu Gin Leu lie Ala Arg Val Thr Ser Val Glu Val Val Leu Asp Lys 
1315 1320 1325 



Leu His Gly Gin Val Leu Ser Gin Glu Gin Tyr Glu Arg Val Leu Ala 
1330 1335 1340 

Glu Asn Thr Arg Pro Ser Gin Met Arg Lys Leu Phe Ser Leu Ser Gin 
345 1350 1355 1360 

Ser Trp Asp Arg Lys Cys Lys Asp Gly Leu Tyr Gin Ala Leu Lys Glu 
1365 1370 1375 

Thr His Pro His Leu He Met Glu Leu Trp Glu Lys Gly Ser Lys Lys 
1380 1385 1390 

Gly Leu Leu Pro Leu Ser Ser 
1395 



<210> 5 

<211> 4332 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (4332) 

<400> 5 

atg get ggc gga gec tgg ggc cgc ctg gec tgt tac rtg gag ttc ctg 48 
Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
15 10 15 

aag aag gag gag ctg aag gag ttc cag ctt ctg etc gec aat aaa gcg 96 
Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

cac tec agg age tct teg ggt gag aca ccc get cag cca gag aag acg 144 
His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 

agt ggc atg gag gtg gec teg tac ctg gtg get cag tat ggg gag cag 192 
Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

egg gec tgg gac eta gec etc cat acc tgg gag cag atg ggg ctg agg 240 
Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Leu Arg 
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65 



70 



75 



80 



tea ctg tgc gec caa gec cag gaa ggg gca ggc cac tct ccc tea ttc 288 
Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 

ccc tac age cca agt gaa ccc cac ctg ggg tct ccc age caa ccc acc 336 
Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 

tec acc gca gtg eta atg ccc tgg ate cat gaa ttg ccg gcg ggg tgc 384 
Ser Thr Ala Val Leu Met Pro Trp lie His Glu Leu Pro Ala Gly Cys 
115 120 125 

acc cag ggc tea gag aga agg gtt ttg aga cag ctg cct gac aca tct 432 
Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 

gga cgc cgc tgg aga gaa ate tct gee tea etc etc tac caa get ctt 480 
Gly Arg Arg Trp Arg Glu He Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 

cca age tec cca gac cat gag tct cca age cag gag tea ccc aac gec 528 
Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 

ccc aca tec aca gca gtg ctg ggg age tgg gga tec cca cct cag ccc 576 
Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

age eta gca ccc aga gag cag gag get cct ggg acc caa tgg cct ctg 624 
Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

gat gaa acg tea gga att tac rac aca gaa ate aga gaa aga gag aga 672 
Asp Glu Thr Ser Gly He Tyr Tyr Thr Glu He A:rg Glu Arg Glu Arg 
210 215 220 

gag aaa tea gag aaa ggc agg ccc cca tgg gca gcg gtg gta gga acg 720 
Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

ccc cca cag gcg cac acc age eta cag ccc cac cac cac cca tgg gag 768 
Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

cct tct gtg aga gag age etc tgt tec aca tgg ccc tgg aaa aat gag 816 
Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
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260 



265 



270 



gat ttt aac caa aaa ttc aca cag ctg eta ctt eta caa aga cct cac 
Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 



864 



ccc aga age caa gat ccc ctg gtc aag aga age tgg cct gat tat gtg 
Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 



912 



gag gag aat cga gga cat tta att gag ate aga gac tta ttt ggc cca 
Glu Glu Asn Arg Gly His Leu lie Glu lie Arg Asp Leu Phe Gly Pro 
305 310 315 320 



960 



ggc ctg gat acc caa gaa cct cgc ata gtc ata ctg cag ggg get get 
Gly Leu Asp Thr Gin Glu Pro Arg He Val He Leu Gin Gly Ala Ala 
325 330 335 



lOOi 



gga att ggg aag tea aca ctg gee agg cag gtg aag gaa gec tgg ggg 
Gly He Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 



105* 



aga ggc cag ctg tat ggg gac cgc ttc cag cat gtc ttc tac ttc age 
Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 



1104 



tgc aga gag ctg gee cag tec aag gtg gtg agt etc get gag etc ate 
Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu lie 
370 375 380 



1152 



gga aaa gat ggg aca gee act ccg get ccc att aga cag ate ctg tct 
Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro He Arg Gin He Leu Ser 
385 390 395 400 



1200 



agg cca gag egg ctg etc ttc ate etc gat ggt gta gat gag cca gga 
Arg Pro Glu Arg Leu Leu Phe He Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 



1248 



tgg gtc ttg cag gag ccg agt tct gag etc tgt ctg cac tgg age cag 
Trp Val Leu Gin Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gin 
420 425 430 



1296 



cca cag ccg gcg gat gca ctg ctg ggc agt ttg ctg ggg aaa act ata 
Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr He 
435 440 445 



1344 



ctt ccc gag gca tec ttc ctg ate acg get egg acc aca get ctg cag 
Leu Pro Glu Ala Ser Phe Leu He Thr Ala Arg Thr Thr Ala Leu Gin 



1392 



30 



450 455 460 

aac etc att cct tct ttg gag cag gca cgt tgg gta gag gtc ctg ggg 1440 
Asn Leu He Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 

ttc tct gag tec age agg aag gaa tat ttc tac aga tat ttc aca gat 1488 
Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 

gaa agg caa gca att aga gec ttt agg ttg gtc aaa tea aac aaa gag 1536 
Glu Arg Gin Ala He Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 

etc tgg gee ctg tgt ctt gtg ccc tgg gtg tec tgg ctg gec tgc act 1584 
Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 

tgc ctg atg cag cag atg aag egg aag gaa aaa etc aca ctg act tec 1632 
Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 

aag ace acc aca ace etc tgt eta cat tac ctt gec cag get etc caa 1680 
Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

get cag cca ttg gga ccc cag etc aga gac etc tgc tct ctg get get 1728 
Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

gag ggc ate tgg caa aaa aag acc ctt ttc agt cca gat gac etc agg 1776 
Glu Gly He Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 

aag cat ggg tta gat ggg gee ate ate tec acc ttc ttg aag atg ggt 1824 
Lys His Gly Leu Asp Gly Ala He lie Ser Thr Phe Leu Lys Met Gly 
595 600 605 

att ctt caa gag cac ccc ate cct ctg age tac age ttc att cac etc 1872 
He Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 

tgt ttc caa gag ttc ttt gca gca atg tec tat gtc ttg gag gat gag 1920 
Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 

aag ggg aga ggt aaa cat tct aat tgc ate ata gat ttg gaa aag acg 1968 
Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
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645 



650 



655 



eta gaa gca tat gga ata cat ggc ctg ttt ggg gca tea acc aca cgt 2016 
Leu Glu Ala Tyr Gly lie His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 

ttc eta ttg ggc ctg tta agt gat gag ggg gag aga gag atg gag aac 2064 
Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

ate ttt cac tgc egg ctg tct cag ggg agg aac ctg atg cag tgg gtc 2112 
lie Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Vai 
690 695 700 

ccg tec ctg cag ctg ctg ctg cag cca cac tct ctg gag tec etc cac 2160 
Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

tgc ttg tac gag act egg aac aaa acg ttc ctg aca caa gtg atg gee 2208 
Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

cat ttc gaa gaa atg ggc atg tgt gta gaa aca gac atg gag etc tta 2256 
His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

gtg tgc act ttc tgc att aaa ttc age cgc cac gtg aag aag ctt cag 2304 
Val Cys Thr Phe Cys lie Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

ctg att gag ggc agg cag cac aga tea aca tgg age ccc acc atg gta 2352 
Leu lie Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

gtc ctg ttc agg tgg gtc cca gtc aca gat gec tat tgg cag att etc 2400 
Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin lie Leu 
785 790 795 800 

ttc tec gtc etc aag gtc acc aga aac ctg aag gag ctg gac eta agt 2448 
Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 

gga aac teg ctg age cac tct gca gtg aag agt ctt tgt aag acc ctg 24 96 
Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 

aga cgc cct cgc tgc etc ctg gag acc ctg egg ttg get ggc tgt ggc 2544 
Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
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835 



840 



845 



etc aca get gag gac tgc aag gac ctt gec ttt ggg ctg aga gec aac 2592 
Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 

cag acc ctg acc gag ctg gac ctg age ttc aat gtg etc acg gat get 2640 
Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
8^5 870 875 880 

gga gec aaa cac ctt tgc cag aga ctg aga cag ccg age tgc aag eta 2688 
Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

cag cga ctg cag ctg gtc age tgt ggc etc acg tct gac tgc tgc cag 2736 
Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 

gac ctg gec tct gtg ctt agt gec age ccc age ctg aag gag eta gac 2784 
Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 

ctg cag cag aac aac ctg gat gac gtt ggc gtg cga ctg etc tgt gag 2832 
Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

ggg etc agg cat cct gec tgc aaa etc ata cgc ctg ggg aaa cca agt 2880 
Gly Leu Arg His Pro Ala Cys Lys Leu lie Arg Leu Gly Lys Pro Ser 
945 950 955 960 

gtg atg acc cct act gag ggc ctg gat acg gga gag atg agt aat age 2928 
Val Met Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser 
965 970 975 

aca tec tea etc aag egg cag aga etc gga tea gag agg gcg get tec 2976 
Thr Ser Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser 
980 985 990 

cat gtt get cag get aat etc aaa etc ctg gac gtg age aag ate ttc 3024 
His Val Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys lie Phe 
995 1000 1005 

cca att get gag att gca gag gaa age tec cca gag gta gta ccg gtg 3072 
Pro lie Ala Glu lie Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val 
1010 1015 1020 

gaa etc ttg tgc gtg cct tct cct gec tct caa ggg gac ctg cat acg 3120 
Glu Leu Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr 



33 



1025 



1030 



1035 



1040 



aag cct ttg ggg act gac gat gac ttc tgg ggc ccc acg ggg cct gtg 3168 
Lys Pro Leu Gly Thr Asp Asp Asp Phe Trp Gly Pro Thr Gly Pro Val 
1045 1050 1055 

get act gag gta gtt gac aaa gaa aag aac ttg tac cga gtt cac ttc 3216 
Ala Thr Glu Val Val Asp Lys Glu Lys Asn Leu Tyr Arg Val His Phe 
1060 1065 1070 

cct gta get ggc tec tac cgc tgg ccc aac acg ggt etc tgc ttt gtg 3264 
Pro Val Ala Gly Ser Tyr Arg Trp Pro Asn Thr Gly Leu Cys Phe Val 
1075 1080 1085 

atg aga gaa gcg gtg acc gtt gag att gaa ttc tgt gtg tgg gac cag 3312 
Met Arg Glu Ala Val Thr Val Glu He Glu Phe Cys Val Trp Asp Gin 
1090 1095 HOO 

ttc ctg ggt gag ate aac cca cag cac age tgg atg gtg gca ggg cct 3360 
Phe Leu Gly Glu He Asn Pro Gin His Ser Trp Met Val Ala Gly Pro 
1105 1110 1H5 1120 

ctg ctg gac ate aag get gag cct gga get gtg gaa get gtg cac etc 3408 
Leu Leu Asp He Lys Ala Glu Pro Gly Ala Val Glu Ala Val His Leu 
H25 H30 H35 

cct cac ttt gtg get etc caa ggg ggc cat gtg gac aca tec ctg ttc 3456 
Pro His Phe Val Ala Leu Gin Gly Gly His Val Asp Thr Ser Leu Phe 
H40 H45 H50 

caa atg gee cac ttt aaa gag gag ggg atg etc ctg gag aag cca gec 3504 
Gin Met Ala His Phe Lys Glu Glu Gly Met Leu Leu Glu Lys Pro Ala 
1155 1160 H65 

agg gtg gag ctg cat cac ata gtt ctg gaa aac ccc age ttc tec ccc 3552 
Arg Val Glu Leu His His He Val Leu Glu Asn Pro Ser Phe Ser Pro 
H70 1175 1180 

ttg gga gtc etc ctg aaa atg ate cat aat gee ctg cgc ttc att ccc 3600 
Leu Gly Val Leu Leu Lys Met He His Asn Ala Leu Arg Phe He Pro 
1185 1190 H95 1200 

gtc acc tct gtg gtg ttg ctt tac cac cgc gtc cat cct gag gaa gtc 3648 
Val Thr Ser Val Val Leu Leu Tyr His Arg Val His Pro Glu Glu Val 
1205 1210 1215 

acc ttc cac etc tac ctg ate cca agt gac tgc tec att egg aag gec 3696 
Thr Phe His Leu Tyr Leu He Pro Ser Asp Cys Ser He Arg Lys Ala 



34 



1220 



1225 



1230 



ata gat gat eta gaa atg aaa ttc cag ttt gtg cga ate cac aag cca 3744 
lie Asp Asp Leu Glu Met Lys Phe Gin Phe Val Arg lie His Lys Pro 
1235 1240 1245 

ccc ccg ctg acc cca ctt tat atg ggc tgt cgt tac act gtg tct ggg 3792 
Pro Pro Leu Thr Pro Leu Tyr Met Gly Cys Arg Tyr Thr Val Ser Gly 
1250 1255 1260 

tct ggt tea ggg atg ctg gaa ata etc ccc aag gaa ctg gag etc tgc 3840 
Ser Gly Ser Gly Met Leu Glu He Leu Pro Lys Glu Leu Glu Leu Cys 
1265 1270 1275 1280 

tat cga age cct gga gaa gac cag ctg ttc teg gag ttc tac gtt ggc 3888 
Tyr Arg Ser Pro Gly Glu Asp Gin Leu Phe Ser Glu Phe Tyr Val Gly 
1285 1290 1295 

cac ttg gga tea ggg ate agg ctg caa gtg aaa gac aag aaa gat gag 3936 
His Leu Gly Ser Gly He Arg Leu Gin Val Lys Asp Lys Lys Asp Glu 
1300 1305 1310 

act ctg gtg tgg gag gee ttg gtg aaa cca gga gat etc atg cct gca 3984 
Thr Leu Val Trp Glu Ala Leu Val Lys Pro Gly Asp Leu Met Pro Ala 
1315 1320 1325 

act act ctg ate cct cca gec cgc ata gee gta cct tea cct ctg gat 4032 
Thr Thr Leu He Pro Pro Ala Arg He Ala Val Pro Ser Pro Leu Asp 
1330 1335 1340 

gee ccg cag ttg ctg cac ttt gtg gac cag tat cga gag cag ctg ata 4080 
Ala Pro Gin Leu Leu His Phe Val Asp Gin Tyr Arg Glu Gin Leu He 
1345 1350 1355 1360 

gee cga gtg aca teg gtg gag gtt gtc ttg gac aaa ctg cat gga cag 4128 
Ala Arg Val Thr Ser Val Glu Val Val Leu Asp Lys Leu His Gly Gin 
1365 1370 1375 

gtg ctg age cag gag cag tac gag agg gtg ctg get gag aac acg agg 4176 
Val Leu Ser Gin Glu Gin Tyr Glu Arg Val Leu Ala Glu Asn Thr Arg 
1380 1385 1390 

ccc age cag atg egg aag ctg ttc age ttg age cag tec tgg gac egg 4224 
Pro Ser Gin Met Arg Lys Leu Phe Ser Leu Ser Gin Ser Trp Asp Arg 
1395 1400 1405 

aag tgc aaa gat gga etc tac caa gee ctg aag gag acc cat cct cac 4272 
Lys Cys Lys Asp Gly Leu Tyr Gin Ala Leu Lys Glu Thr His Pro His 
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1410 



1415 



1420 



etc att atg gaa etc tgg gag aag ggc age aaa aag gga etc ctg cca 
Leu He Met Glu Leu Trp Glu Lys Gly Ser Lys Lys Gly Leu Leu Pro 
1425 1430 1435 1440 



4320 



etc age age tga 
Leu Ser Ser 



4332 



<210> 6 
<211> 1443 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Ala Gly Gly 
1 

Lys Lys Glu Glu 
20 

His Ser Arg Ser 
•35 

Ser Gly Met Glu 
50 



Ala Trp Gly Arg 
5 

Leu Lys Glu Phe 



Ser Ser Gly Glu 
40 

Val Ala Ser Tyr 
55 



Leu Ala Cys Tyr 
10 

Gin Leu Leu Leu 
25 

Thr Pro Ala Gin 



Leu Val Ala Gin 
60 



Leu Glu Phe Leu 
15 

Ala Asn Lys Ala 
30 

Pro Glu Lys Thr 
45 

Tyr Gly Glu Gin 



Arg Ala Trp Asp 
65 

Ser Leu Cys Ala 



Pro Tyr Ser Pro 
100 

Ser Thr Ala Val 
115 

Thr Gin Gly Ser 
130 

Gly Arg Arg Trp 
145 

Pro Ser Ser Pro 



Leu Ala Leu His 
70 

Gin Ala Gin Glu 
85 

Ser Glu Pro His 



Leu Met Pro Trp 
120 

Glu Arg Arg Val 
135 

Arg Glu He Ser 
150 

Asp His Glu Ser 
165 



Thr Trp Glu Gin 
75 

Gly Ala Gly His 
90 

Leu Gly Ser Pro 
105 

He His Glu Leu 



Leu Arg Gin Leu 
140 

Ala Ser Leu Leu 
155 

Pro Ser Gin Glu 
170 



Met Gly Leu Arg 
80 

Ser Pro Ser Phe 
95 

Ser Gin Pro Thr 
110 

Pro Ala Gly Cys 
125 

Pro Asp Thr Ser 



Tyr Gin Ala Leu 
160 

Ser Pro Asn Ala 
175 
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Pro Thr Ser Thr 
180 

Ser Leu Ala Pro 
195 

Asp Glu Thr Ser 
210 

Glu Lys Ser Glu 
225 

Pro Pro Gin Ala 



Pro Ser Val Arg 
260 

Asp Phe Asn Gin 
275 

Pro Arg Ser Gin 
290 

Glu Glu Asn Arg 
305 

Gly Leu Asp Thr 



Gly lie Gly Lys 
340 

Arg Gly Gin Leu 
355 

Cys Arg Glu Leu 
370 

Gly Lys Asp Gly 
385 

Arg Pro Glu Arg 



Trp Val Leu Gin 
420 



Ala Val Leu Gly 



Arg Glu Gin Glu 
200 

Gly lie Tyr Tyr 
215 

Lys Gly Arg Pro 
230 

His Thr Ser Leu 
245 

Glu Ser Leu Cys 



Lys Phe Thr Gin 
280 

Asp Pro Leu Val 
295 

Gly His Leu lie 
310 

Gin Glu Pro Arg 
325 

Ser Thr Leu Ala 



Tyr Gly Asp Arg 
360 

Ala Gin Ser Lys 
375 

Thr Ala Thr Pro 
390 

Leu Leu Phe lie 
405 

Glu Pro Ser Ser 



Ser Trp Gly Ser 
185 

Ala Pro Gly Thr 



Thr Glu lie Arg 
220 

Pro Trp Ala Ala 
235 

Gin Pro His His 
250 

Ser Thr Trp Pro 
265 

Leu Leu Leu Leu 



Lys Arg Ser Trp 
300 

Glu lie Arg Asp 
315 

lie Val He Leu 
330 

Arg Gin Val Lys 
345 

Phe Gin His Val 



Val Val Ser Leu 
380 

Ala Pro He Arg 
395 

Leu Asp Gly Val 
410 

Glu Leu Cys Leu 
425 



Pro Pro Gin Pro 
190 

Gin Trp Pro Leu 
205 

Glu Arg Glu Arg 



Val Val Gly Thr 
240 

His Pro Trp Glu 
255 

Trp Lys Asn Glu 
270 

Gin Arg Pro His 
285 

Pro Asp Tyr Val 



Leu Phe Gly Pro 
320 

Gin Gly Ala Ala 
335 

Glu Ala Trp Gly 
350 

Phe Tyr Phe Ser 
365 

Ala Glu Leu He 



Gin He Leu Ser 
400 

Asp Glu Pro Gly 
415 

His Trp Ser Gin 
430 
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Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr lie 
435 440 445 



Leu Pro Glu Ala Ser Phe Leu lie Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 

Asn Leu lie Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 

Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 

Glu Arg Gin Ala lie Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 

Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 

Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 

Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

Glu Gly lie Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 

Lys His Gly Leu Asp Gly Ala lie lie Ser Thr Phe Leu Lys Met Gly 
595 600 605 

lie Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 

Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 

Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
645 650 655 

Leu Glu Ala Tyr Gly He His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 



Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 
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lie Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 



Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 800 

Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 

Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 

Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 

Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 

Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 

Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 

Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 



Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 
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Gly Leu Arg His Pro Ala Cys Lys Leu He Arg Leu Gly Lys Pro Ser 
945 950 955 960 

Val Met Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser 
965 970 975 

Thr Ser Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser 
980 985 990 

His Val Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys lie Phe 
995 1000 1005 

Pro He Ala Glu He Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val 
1010 1015 1020 

Glu Leu Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr 
025 1030 1035 1040 

Lys Pro Leu Gly Thr Asp Asp Asp Phe Trp Gly Pro Thr Gly Pro Val 
1°45 1050 1055 

Ala Thr Glu Val Val Asp Lys Glu Lys Asn Leu Tyr Arg Val His Phe 
1060 1065 1070 

Pro Val Ala Gly Ser Tyr Arg Trp Pro Asn Thr Gly Leu Cys Phe Val 
1075 1080 1085 

Met Arg Glu Ala Val Thr Val Glu He Glu Phe Cys Val Trp Asp Gin 
1090 1095 HOO 

Phe Leu Gly Glu He Asn Pro Gin His Ser Trp Met Val Ala Gly Pro 
105 mo ins H20 

Leu Leu Asp He Lys Ala Glu Pro Gly Ala Val Glu Ala Val His Leu 
1125 H30 H35 

Pro His Phe Val Ala Leu Gin Gly Gly His Val Asp Thr Ser Leu Phe 
1140 H45 H50 

Gin Met Ala His Phe Lys Glu Glu Gly Met Leu Leu Glu Lys Pro Ala 
1155 H60 1165 

Arg Val Glu Leu His His He Val Leu Glu Asn Pro Ser Phe Ser Pro 
11^0 1175 H80 



Leu Gly Val Leu Leu Lys Met He His Asn Ala Leu Arg Phe He Pro 
185 H90 H95 1200 
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Val Thr Ser Val Val Leu Leu Tyr His Arg Val His Pro Glu Glu Val 
1205 1210 1215 



Thr Phe His Leu Tyr Leu He Pro Ser Asp Cys Ser He Arg Lys Ala 
1220 1225 1230 

He Asp Asp Leu Glu Met Lys Phe Gin Phe Val Arg He His Lys Pro 
1235 1240 1245 

Pro Pro Leu Thr Pro Leu Tyr Met Gly Cys Arg Tyr Thr Val Ser Gly 
1250 1255 1260 

Ser Gly Ser Gly Met Leu Glu He Leu Pro Lys Glu Leu Glu Leu Cys 
265 1270 1275 1280 

Tyr Arg Ser Pro Gly Glu Asp Gin Leu Phe Ser Glu Phe Tyr Val Gly 
1285 1290 1295 

His Leu Gly Ser Gly He Arg Leu Gin Val Lys Asp Lys Lys Asp Glu 
1300 1305 1310 

Thr Leu Val Trp Glu Ala Leu Val Lys Pro Gly Asp Leu Met Pro Ala 
1315 1320 1325 

Thr Thr Leu He Pro Pro Ala Arg He Ala Val Pro Ser Pro Leu Asp 
1330 1335 1340 

Ala Pro Gin Leu Leu His Phe Val Asp Gin Tyr Arg Glu Gin Leu He 
345 1350 1355 1360 

Ala Arg Val Thr Ser Val Glu Val Val Leu Asp Lys Leu His Gly Gin 
1365 1370 1375 

Val Leu Ser Gin Glu Gin Tyr Glu Arg Val Leu Ala Glu Asn Thr Arg 
1380 1385 1390 

Pro Ser Gin Met Arg Lys Leu Phe Ser Leu Ser Gin Ser Trp Asp Arg 
1395 1400 1405 

Lys Cys Lys Asp Gly Leu Tyr Gin Ala Leu Lys Glu Thr His Pro His 
1410 1415 1420 

Leu He Met Glu Leu Trp Glu Lys Gly Ser Lys Lys Gly Leu Leu Pro 
425 1430 1435 1440 

Leu Ser Ser 
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<210> 7 

<211> 1487 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1296) 

<400> 7 

atg atg aga cag agg cag age cat tat tgt tec gtg ctg ttc ctg agt 48 

Met Met Arg Gin Arg Gin Ser His Tyr Cys Ser Val Leu Phe Leu Ser 

1 5 10 15 

gtc aac tat ctg ggg ggg aca ttc cca gga gac att tgc tea gaa gag 96 
Val Asn Tyr Leu Gly Gly Thr Phe Pro Gly Asp He Cys Ser Glu Glu 
20 25 30 

aat caa ata gtt tec tct tat get tct aaa gtc tgt ttt gag ate gaa 144 
Asn Gin He Val Ser Ser Tyr Ala Ser Lys Val Cys Phe Glu He Glu 
35 40 45 

gaa gat tat aaa aat cgt cag ttt ctg ggg cct gaa gga aat gtg gat 192 
Glu Asp Tyr Lys Asn Arg Gin Phe Leu Gly Pro Glu Gly Asn Val Asp 
50 55 60 

gtt gag ttg att gat aag age aca aac aga tac age gtt tgg ttc ccc 240 
Val Glu Leu lie Asp Lys Ser Thr Asn Arg Tyr Ser Val Trp Phe Pro 
65 70 75 80 

act get ggc tgg tat ctg tgg tea gec aca ggc etc ggc ttc ctg gta 288 
Thr Ala Gly Trp Tyr Leu Trp Ser Ala Thr Gly Leu Gly Phe Leu Val 
85 90 95 

agg gat gag gtc aca gtg acg att gcg ttt ggt tec tgg agt cag cac 336 
Arg Asp Glu Val Thr Val Thr He Ala Phe Gly Ser Trp Ser Gin His 
100 105 HO 

ctg gec ctg gac ctg cag cac cat gaa cag tgg ctg gtg ggc ggc ccc 384 
Leu Ala Leu Asp Leu Gin His His Glu Gin Trp Leu Val Gly Gly Pro 
115 120 125 

ttg ttt gat gtc act gca gag cca gag gag get gtc gec gaa ate cac 432 
Leu Phe Asp Val Thr Ala Glu Pro Glu Glu Ala Val Ala Glu He His 
130 135 140 

etc ccc cac ttc ate tec etc caa ggt gag gtg gac gtc tec tgg ttt 480 
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Leu Pro His Phe lie Ser Leu Gin Gly Glu Val Asp Val Ser Trp Phe 
145 150 155 160 



etc gtt gee cat ttt aag aat gaa ggg atg gtc ctg gag cat cca gec 528 
Leu Val Ala His Phe Lys Asn Glu Gly Met Val Leu Glu His Pro Ala 
165 170 175 

egg gtg gag cct ttc tat get gtc ctg gaa age ccc age ttc tct ctg 576 
Arg Val Glu Pro Phe Tyr Ala Val Leu Glu Ser Pro Ser Phe Ser Leu 
180 185 190 

atg ggc ate ctg ctg egg ate gee agt ggg act cgc etc tec ate ccc 624 
Met Gly He Leu Leu Arg He Ala Ser Gly Thr Arg Leu Ser He Pro 
195 200 205 

ate act tec aac aca ttg ate tat tat cac ccc cac ccc gaa gat att 672 
He Thr Ser Asn Thr Leu He Tyr Tyr His Pro His Pro Glu Asp He 
210 215 220 

aag ttc cac ttg tac ctt gtc ccc age gac gec ttg eta aca aag gcg 720 
Lys Phe His Leu Tyr Leu Val Pro Ser Asp Ala Leu Leu Thr Lys Ala 
225 230 235 240 

ata gat gat gag gaa gat cgc ttc cat ggt gtg cgc ctg cag act teg 768 
He Asp Asp Glu Glu Asp Arg Phe His Gly Val Arg Leu Gin Thr Ser 
245 250 255 

ccc cca atg gaa ccc ctg aac ttt ggt tec agt tat att gtg tct aat 816 
Pro Pro Met Glu Pro Leu Asn Phe Gly Ser Ser Tyr He Val Ser Asn 
260 265 270 

tct get aac ctg aaa gta atg ccc aag gag ttg aaa ttg tec tac agg 864 
Ser Ala Asn Leu Lys Val Met Pro Lys Glu Leu Lys Leu Ser Tyr Arg 
275 280 285 

age cct gga gaa att cag cac ttc tea aaa ttc tat get ggg cag atg 912 
Ser Pro Gly Glu He Gin His Phe Ser Lys Phe Tyr Ala Gly Gin Met 
290 295 300 

aag gaa ccc att caa ctt gag att act gaa aaa aga cat ggg act ttg 960 
Lys Glu Pro He Gin Leu Glu He Thr Glu Lys Arg His Gly Thr Leu 
305 310 315 320 

gtg tgg gat act gag gtg aag cca gtg gat etc cag ctt gta get gca 1008 
Val Trp Asp Thr Glu Val Lys Pro Val Asp Leu Gin Leu Val Ala Ala 
325 330 335 

tea gee cct cct cct ttc tea ggt gca gec ttt gtg aag gag aac cac 1056 



43 



Ser Ala Pro Pro Pro Phe Ser Gly Ala Ala Phe Val Lys Glu Asn His 
340 345 350 



egg caa etc caa gec agg atg ggg gac ctg aaa ggg gtg etc gat gat 1104 
Arg Gin Leu Gin Ala Arg Met Gly Asp Leu Lys Gly Val Leu Asp Asp 
355 360 365 

etc cag gac aat gag gtt ctt act gag aat gag aag gag ctg gtg gag 1152 
Leu Gin Asp Asn Glu Val Leu Thr Glu Asn Glu Lys Glu Leu Val Glu 
370 375 380 

cag gaa aag aca egg cag age aag aat gag gee ttg ctg age atg gtg 1200 
Gin Glu Lys Thr Arg Gin Ser Lys Asn Glu Ala Leu Leu Ser Met Val 
385 390 395 400 

gag aag aaa ggg gac ctg gec ctg gac gtg etc ttc aga age att agt 1248 
Glu Lys Lys Gly Asp Leu Ala Leu Asp Val Leu Phe Arg Ser lie Ser 
405 410 415 

gaa agg gac cct tac etc gtg tec tat ctt aga cag cag aat ttg taa 1296 
Glu Arg Asp Pro Tyr Leu Val Ser Tyr Leu Arg Gin Gin Asn Leu 
420 425 430 

aatgagtcag ttaggtagtc tggaagagag aatccagcgt tctcattgga aatggataaa 1356 

cagaaatgtg atcattgatt tcagtgttca agacagaaga agactgggta acatctatca 1416 

cacaggcttt caggacagac ttgtaacctg gcatgtacct attgactgta tcctcatgca 1476 

ttttcctcaa g 1487 



<210> 8 
<211> 431 
<212> PRT 

<213> Homo sapiens 



<400> 8 

Met Met Arg Gin 
1 

Val Asn Tyr Leu 
20 

Asn Gin lie Val 
35 

Glu Asp Tyr Lys 



Arg Gin Ser His 
5 

Gly Gly Thr Phe 

Ser Ser Tyr Ala 
40 

Asn Arg Gin Phe 



Tyr Cys Ser Val 
10 

Pro Gly Asp lie 
25 

Ser Lys Val Cys 



Leu Gly Pro Glu 



Leu Phe Leu Ser 
15 

Cys Ser Glu Glu 
30 

Phe Glu lie Glu 
45 

Gly Asn Val Asp 
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50 



55 



60 



Val Glu Leu lie Asp Lys Ser Thr Asn Arg Tyr Ser Val Trp Phe Pro 
65 70 75 80 

Thr Ala Gly Trp Tyr Leu Trp Ser Ala Thr Gly Leu Gly Phe Leu Val 
85 90 95 

Arg Asp Glu Val Thr Val Thr He Ala Phe Gly Ser Trp Ser Gin His 
100 105 HO 

Leu Ala Leu Asp Leu Gin His His Glu Gin Trp Leu Val Gly Gly Pro 
115 120 125 

Leu Phe Asp Val Thr Ala Glu Pro Glu Glu Ala Val Ala Glu He His 
130 135 140 

Leu Pro His Phe He Ser Leu Gin Gly Glu Val Asp Val Ser Trp Phe 
145 150 155 160 

Leu Val Ala His Phe Lys Asn Glu Gly Met Val Leu Glu His Pro Ala 
165 170 175 

Arg Val Glu Pro Phe Tyr Ala Val Leu Glu Ser Pro Ser Phe Ser Leu 
180 185 190 

Met Gly He Leu Leu Arg He Ala Ser Gly Thr Arg Leu Ser He Pro 
195 200 205 

He Thr Ser Asn Thr Leu He Tyr Tyr His Pro His Pro Glu Asp He 
210 215 220 

Lys Phe His Leu Tyr Leu Val Pro Ser Asp Ala Leu Leu Thr Lys Ala 
225 230 235 240 

He Asp Asp Glu Glu Asp Arg Phe His Gly Val Arg Leu Gin Thr Ser 
245 250 255 

Pro Pro Met Glu Pro Leu Asn Phe Gly Ser Ser Tyr He Val Ser Asn 
260 265 270 

Ser Ala Asn Leu Lys Val Met Pro Lys Glu Leu Lys Leu Ser Tyr Arg 
275 280 285 

Ser Pro Gly Glu He Gin His Phe Ser Lys Phe Tyr Ala Gly Gin Met 
290 295 300 



Lys Glu Pro lie Gin Leu Glu He Thr Glu Lys Arg His Gly Thr Leu 



45 



305 

Val Trp Asp Thr 



Ser Ala Pro Pro 
340 

Arg Gin Leu Gin 
355 

Leu Gin Asp Asn 
370 

Gin Glu Lys Thr 
385 

Glu Lys Lys Gly 



Glu Arg Asp Pro 
420 



310 

Glu Val Lys Pro 
325 

Pro Phe Ser Gly 



Ala Arg Met Gly 
360 

Glu Val Leu Thr 

375 

Arg Gin Ser Lys 
390 

Asp Leu Ala Leu 
405 

Tyr Leu Val Ser 



315 

Val Asp Leu Gin 
330 

Ala Ala Phe Val 
345 

Asp Leu Lys Gly 



Glu Asn Glu Lys 
380 

Asn Glu Ala Leu 
395 

Asp Val Leu Phe 
410 

Tyr Leu Arg Gin 
425 



320 

Leu Val Ala Ala 
335 

Lys Glu Asn His 
350 

Val Leu Asp Asp 
365 

Glu Leu Val Glu 



Leu Ser Met Val 
400 

Arg Ser lie Ser 
415 

Gin Asn Leu 
430 



<210> 9 
<211> 4556 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (4365) 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Construct 

<400> 9 

atg get ggc gga gec tgg ggc cgc ctg gec tgt tac ttg gag ttc ctg 48 

Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
1 5 10 15 

aag aag gag gag ctg aag gag ttc cag ctt ctg etc gec aat aaa gcg 96 
Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

cac tec agg age tct teg ggt gag aca ccc get cag cca gag aag acg 144 
His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
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35 



40 



45 



agt ggc atg gag gtg gcc teg tac ctg gtg get cag tat ggg gag cag 192 
Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

egg gcc tgg gac eta gcc etc cat ace tgg gag cag atg ggg ctg agg 240 
Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Leu Arg 
65 70 75 80 

tea ctg tgc gcc caa gcc cag gaa ggg gca ggc cac tct ccc tea ttc 288 
Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 

ccc tac age cca agt gaa ccc cac ctg ggg tct ccc age caa ccc acc 336 
Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 

tec acc gca gtg eta atg ccc tgg ate cat gaa ttg ccg gcg ggg tgc 384 
Ser Thr Ala Val Leu Met Pro Trp lie His Glu Leu Pro Ala Gly Cys 
115 120 125 

acc cag ggc tea gag aga agg gtt ttg aga cag ctg cct gac aca tct 432 
Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 

gga cgc cgc tgg aga gaa ate tct gcc tea etc etc tac caa get ctt 480 
Gly Arg Arg Trp Arg Glu He Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 

cca age tec cca gac cat gag tct cca age cag gag tea ccc aac gcc 528 
Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 

ccc aca tec aca gca gtg ctg ggg age tgg gga tec cca cct cag ccc 576 
Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

age eta gca ccc aga gag cag gag get cct ggg acc caa tgg cct ctg 624 
Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

gat gaa acg tea gga att tac tac aca gaa ate aga gaa aga gag aga 672 
Asp Glu Thr Ser Gly He Tyr Tyr Thr Glu He Arg Glu Arg Glu Arg 
210 215 220 

gag aaa tea gag aaa ggc agg ccc cca tgg gca gcg gtg gta gga acg 720 
Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
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225 

ccc cca cag gcg 
Pro Pro Gin Ala 



cct tct gtg aga 
Pro Ser Val Arg 
260 

gat ttt aac caa 
Asp Phe Asn Gin 
275 

ccc aga age caa 
Pro Arg Ser Gin 
290 

gag gag aat cga 
Glu Glu Asn Arg 
305 

ggc ctg gat acc 
Gly Leu Asp Thr 



gga att ggg aag 
Gly lie Gly Lys 
340 

aga ggc cag ctg 
Arg Gly Gin Leu 
355 

tgc aga gag ctg 
Cys Arg Glu Leu 
370 

gga aaa gat ggg 
Gly Lys Asp Gly 
385 

agg cca gag egg 
Arg Pro Glu Arg 



tgg gtc ttg cag 
Trp Val Leu Gin 



230 

cac acc age eta cag 
His Thr Ser Leu Gin 
245 

gag age etc tgt tec 
Glu Ser Leu Cys Ser 
265 

aaa ttc aca cag ctg 
Lys Phe Thr Gin Leu 
280 

gat ccc ctg gtc aag 
Asp Pro Leu Val Lys 
295 

gga cat tta att gag 
Gly His Leu lie Glu 
310 

caa gaa cct cgc ata 
Gin Glu Pro Arg He 
325 

tea aca ctg gec agg 
Ser Thr Leu Ala Arg 
345 

tat ggg gac cgc ttc 
Tyr Gly Asp Arg Phe 
360 

gec cag tec aag gtg 
Ala Gin Ser Lys Val 
375 

aca gec act ccg get 
Thr Ala Thr Pro Ala 

390 

ctg etc ttc ate etc 
Leu Leu Phe He Leu 
405 

gag ccg agt tct gag 
Glu Pro Ser Ser Glu 



235 

ccc cac cac cac 
Pro His His His 
250 

aca tgg ccc tgg 
Thr Trp Pro Trp 

eta ctt eta caa 
Leu Leu Leu Gin 
285 

aga age tgg cct 
Arg Ser Trp Pro 
300 

ate aga gac tta 
He Arg Asp Leu 
315 

gtc ata ctg cag 
Val He Leu Gin 
330 

cag gtg aag gaa 
Gin Val Lys Glu 



cag cat gtc ttc 
Gin His Val Phe 
365 

gtg agt etc get 
Val Ser Leu Ala 
380 

ccc att aga cag 
Pro He Arg Gin 
395 

gat ggt gta gat 
Asp Gly Val Asp 
410 

etc tgt ctg cac 
Leu Cys Leu His 



240 

cca tgg gag 768 
Pro Trp Glu 
255 

aaa aat gag 816 

Lys Asn Glu 

270 

aga cct cac 8 64 
Arg Pro His 



gat tat gtg 912 
Asp Tyr Val 



ttt ggc cca 960 
Phe Gly Pro 
320 

ggg get get 1008 
Gly Ala Ala 
335 

gec tgg ggg 1056 

Ala Trp Gly 

350 

tac ttc age 1104 
Tyr Phe Ser 



gag etc ate 1152 
Glu Leu He 



ate ctg tct 1200 
He Leu Ser 
400 

gag cca gga 1248 
Glu Pro Gly 
415 

tgg age cag 1296 
Trp Ser Gin 



48 



420 



425 



430 



cca cag ccg gcg gat gca ctg ctg ggc agt ttg ctg ggg aaa act ata 
Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr lie 
435 440 445 



1344 



ctt ccc gag gca tec ttc ctg ate acg get egg ace aca get ctg cag 
Leu Pro Glu Ala Ser Phe Leu lie Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 



1392 



aac etc att cct tct ttg gag cag gca cgt tgg gta gag gtc ctg ggg 
Asn Leu lie Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 



1440 



ttc tct gag tec age agg aag gaa tat ttc tac aga tat ttc aca gat 
Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 



1488 



gaa agg caa gca att aga gee ttt agg ttg gtc aaa tea aac aaa gag 
Glu Arg Gin Ala lie Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 



1536 



etc tgg gee ctg tgt ctt gtg ccc tgg gtg tec tgg ctg gec tgc act 
Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 



1584 



tgc ctg atg cag cag atg aag egg aag gaa aaa etc aca ctg act tec 
Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 



1632 



aag acc acc aca acc etc tgt eta cat tac ctt gee cag get etc caa 
Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 



1680 



get cag cca ttg gga ccc cag etc aga gac etc tgc tct ctg get get 
Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 



1728 



gag ggc ate tgg caa aaa aag acc ctt ttc agt cca gat gac etc agg 
Glu Gly lie Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 



1776 



aag cat ggg tta gat ggg gec ate ate tec acc ttc ttg aag atg ggt 
Lys His Gly Leu Asp Gly Ala lie lie Ser Thr Phe Leu Lys Met Gly 
595 600 605 



1824 



att ctt caa gag cac ccc ate cct ctg age tac age ttc att cac etc 
lie Leu Gin Glu His Pro lie Pro Leu Ser Tyr Ser Phe lie His Leu 



1872 



49 



610 615 620 

tgt ttc caa gag ttc ttt gca gca atg tec tat gtc ttg gag gat gag 1920 
Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 

sag ggg aga ggt aaa cat tct aat tgc ate ata gat ttg gaa aag acg 1968 
Lys Gly Arg Gly Lys His Ser Asn Cys lie lie Asp Leu Glu Lys Thr 
645 650 655 

eta gaa gca tat gga ata cat ggc ctg ttt ggg gca tea ace aca cgt 2016 
Leu Glu Ala Tyr Gly lie His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 

ttc eta ttg ggc ctg tta agt gat gag ggg gag aga gag atg gag aac 2064 
Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

ate ttt cac tgc egg ctg tct cag ggg agg aac ctg atg cag tgg gtc 2112 
lie Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 

ccg tec ctg cag ctg ctg ctg cag cca cac tct ctg gag tec etc cac 2160 
Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

tgc ttg tac gag act egg aac aaa acg ttc ctg aca caa gtg atg gee 2208 
Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

cat ttc gaa gaa atg ggc atg tgt gta gaa aca gac atg gag etc tta 2256 
His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

gtg tgc act ttc tgc att aaa ttc age cgc cac gtg aag aag ctt cag 2304 
Val Cys Thr Phe Cys lie Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

ctg att gag ggc agg cag cac aga tea aca tgg age ccc ace atg gta 2352 
Leu lie Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

gtc ctg ttc agg tgg gtc cca gtc aca gat gec tat tgg cag att etc 2400 
Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin lie Leu 
785 790 795 800 

ttc tec gtc etc aag gtc ace aga aac ctg aag gag ctg gac eta agt 2448 
Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 



50 



805 



810 



815 



gga aac teg ctg age cac tct gca gtg aag agt ctt tgt aag acc ctg 24 96 
Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 

aga cgc cct cgc tgc etc ctg gag acc ctg egg ttg get ggc tgt ggc 2544 
Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 

etc aca get gag gac tgc aag gac ctt gec ttt ggg ctg aga gee aac 2592 
Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 

cag acc ctg acc gag ctg gac ctg age ttc aat gtg etc acg gat get 2640 
Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 

gga gee aaa cac ctt tgc cag aga ctg aga cag ccg age tgc aag eta 2688 
Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

cag cga ctg cag ctg gtc age tgt ggc etc acg tct gac tgc tgc cag 2736 
Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 

gac ctg gec tct gtg ctt agt gee age ccc age ctg aag gag eta gac 2784 
Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 

ctg cag cag aac aac ctg gat gac gtt ggc gtg cga ctg etc tgt gag 2832 
Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

ggg etc agg cat cct gee tgc aaa etc ata cgc ctg ggg ctg gac cag 2880 
Gly Leu Arg His Pro Ala Cys Lys Leu lie Arg Leu Gly Leu Asp Gin 
945 950 955 960 

aca act ctg agt gat gag atg agg cag gaa ctg agg gec ctg gag cag 2928 
Thr Thr Leu Ser Asp Glu Met Arg Gin Glu Leu Arg Ala Leu Glu Gin 
965 970 975 

gag aaa cct cag ctg etc ate ttc age aga egg aaa cca agt gtg atg 2976 
Glu Lys Pro Gin Leu Leu lie Phe Ser Arg Arg Lys Pro Ser Val Met 
980 985 990 

acc cct act gag ggc ctg gat acg gga gag atg agt aat age aca tec 3024 
Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser Thr Ser 
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995 



1000 



1005 



tea etc aag egg cag aga etc gga tea gag agg gcg get tec cat gtt 3072 
Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser His Val 
1010 1015 1020 

get cag get aat etc aaa etc ctg gac gtg age aag ate ttc cca att 3120 
Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys lie Phe Pro lie 
1025 1030 1035 1040 

get gag att gca gag gaa age tec cca gag gta gta ccg gtg gaa etc 3168 
Ala Glu lie Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val Glu Leu 
1045 1050 1055 

ttg tgc gtg cct tct cct gec tct caa ggg gac ctg cat acg aag cct 3216 
Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr Lys Pro 
1060 1065 1070 

ttg ggg act gac gat gac ttt ctg ggg cct gaa gga aat gtg gat gtt 3264 
Leu Gly Thr Asp Asp Asp Phe Leu Gly Pro Glu Gly Asn Val Asp Val 
1075 1080 1085 

gag ttg att gat aag age aca aac aga tac age gtt tgg ttc ccc act 3312 
Glu Leu lie Asp Lys Ser Thr Asn Arg Tyr Ser Val Trp Phe Pro Thr 
1090 1095 1100 

get ggc tgg tat ctg tgg tea gee aca ggc etc ggc ttc ctg gta agg 3360 
Ala Gly Trp Tyr Leu Trp Ser Ala Thr Gly Leu Gly Phe Leu Val Arg 
1105 1110 1115 1120 

gat gag gtc aca gtg acg att gcg ttt ggt tec tgg agt cag cac ctg 3408 
Asp Glu Val Thr Val Thr lie Ala Phe Gly Ser Trp Ser Gin His Leu 
1125 1130 1135 

gec ctg gac ctg cag cac cat gaa cag tgg ctg gtg ggc ggc ccc ttg 3456 
Ala Leu Asp Leu Gin His His Glu Gin Trp Leu Val Gly Gly Pro Leu 
1140 1145 1150 

ttt gat gtc act gca gag cca gag gag get gtc gee gaa ate cac etc 3504 
Phe Asp Val Thr Ala Glu Pro Glu Glu Ala Val Ala Glu He His Leu 
1155 1160 1165 

ccc cac ttc ate tec etc caa ggt gag gtg gac gtc tec tgg ttt etc 3552 
Pro His Phe He Ser Leu Gin Gly Glu Val Asp Val Ser Trp Phe Leu 
1170 1175 1180 

gtt gee cat ttt aag aat gaa ggg atg gtc ctg gag cat cca gee egg 3600 
Val Ala His Phe Lys Asn Glu Gly Met Val Leu Glu His Pro Ala Arg 
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1185 



1190 



1195 



1200 



gtg gag cct ttc tat get gtc ctg gaa age ccc age ttc tct ctg atg 3648 
Val Glu Pro Phe Tyr Ala Val Leu Glu Ser Pro Ser Phe Ser Leu Met 
1205 1210 1215 

ggc ate ctg ctg egg ate gee agt ggg act cgc etc tec ate ccc ate 3696 
Gly lie Leu Leu Arg lie Ala Ser Gly Thr Arg Leu Ser He Pro He 
1220 1225 1230 

act tec aac aca ttg ate tat tat cac ccc cac ccc gaa gat att aag 3744 
Thr Ser Asn Thr Leu He Tyr Tyr His Pro His Pro Glu Asp He Lys 
1235 1240 1245 

ttc cac ttg tac ctt gtc ccc age gac gec ttg eta aca aag gcg ata 3792 
Phe His Leu Tyr Leu Val Pro Ser Asp Ala Leu Leu Thr Lys Ala He 
1250 1255 1260 

gat gat gag gaa gat cgc ttc cat ggt gtg cgc ctg cag act teg ccc 3840 
Asp Asp Glu Glu Asp Arg Phe His Gly Val Arg Leu Gin Thr Ser Pro 
1265 1270 1275 1280 

cca atg gaa ccc ctg aac ttt ggt tec agt tat att gtg tct aat tct 3888 
Pro Met Glu Pro Leu Asn Phe Gly Ser Ser Tyr He Val Ser Asn Ser 
1285 1290 1295 

get aac ctg aaa gta atg ccc aag gag ttg aaa ttg tec tac agg age 3936 
Ala Asn Leu Lys Val Met Pro Lys Glu Leu Lys Leu Ser Tyr Arg Ser 
1300 1305 1310 

cct gga gaa att cag cac ttc tea aaa ttc tat get ggg cag atg aag 3984 
Pro Gly Glu He Gin His Phe Ser Lys Phe Tyr Ala Gly Gin Met Lys 
1315 1320 1325 

gaa ccc att caa ctt gag att act gaa aaa aga cat ggg act ttg gtg 4032 
Glu Pro He Gin Leu Glu He Thr Glu Lys Arg His Gly Thr Leu Val 
1330 1335 1340 

tgg gat act gag gtg aag cca gtg gat etc cag ctt gta get gca tea 4080 
Trp Asp Thr Glu Val Lys Pro Val Asp Leu Gin Leu Val Ala Ala Ser 
1345 1350 1355 1360 

gec cct cct cct ttc tea ggt gca gec ttt gtg aag gag aac cac egg 4128 
Ala Pro Pro Pro Phe Ser Gly Ala Ala Phe Val Lys Glu Asn His Arg 
1365 1370 1375 

caa etc caa gec agg atg ggg gac ctg aaa ggg gtg etc gat gat etc 4176 
Gin Leu Gin Ala Arg Met Gly Asp Leu Lys Gly Val Leu Asp Asp Leu 
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1380 



1385 



1390 



cag gac aat gag gtt ctt act gag aat gag aag gag ctg gtg gag cag 4224 
Gin Asp Asn Glu Val Leu Thr Glu Asn Glu Lys Glu Leu Val Glu Gin 
1395 1400 1405 

gaa aag aca egg cag age aag aat gag gec ttg ctg age atg gtg gag 4272 
Glu Lys Thr Arg Gin Ser Lys Asn Glu Ala Leu Leu Ser Met Val Glu 
1410 1415 1420 

aag aaa ggg gac ctg gec ctg gac gtg etc ttc aga age att agt gaa 4320 
Lys Lys Gly Asp Leu Ala Leu Asp Val Leu Phe Arg Ser He Ser Glu 
1425 1430 1435 1440 

agg gac cct tac etc gtg tec tat ctt aga cag cag aat ttg taa 4365 
Arg Asp Pro Tyr Leu Val Ser Tyr Leu Arg Gin Gin Asn Leu 

1445 1450 1455 

aatgagtcag ttaggtagtc tggaagagag aatccagcgt tctcattgga aatggataaa 4425 

cagaaatgtg atcattgatt tcagtgttca agacagaaga agactgggta acatctatca 4485 

cacaggcttt caggacagac ttgtaacctg gcatgtacct attgactgta tcctcatgca 4545 

ttttcctcaa g 4556 



<210> 10 
<211> 1454 
<212> PRT 

<213> Artificial Sequence 
<400> 10 

Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
15 10 15 

Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 

Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Leu Arg 
65 70 75 80 
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Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 



Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 

Ser Thr Ala Val Leu Met Pro Trp lie His Glu Leu Pro Ala Gly Cys 
115 120 125 

Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 

Gly Arg Arg Trp Arg Glu lie Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 

Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 

Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 185 190 

Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

Asp Glu Thr Ser Gly lie Tyr Tyr Thr Glu lie Arg Glu Arg Glu Arg 
210 215 220 

Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 

Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 

Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 

Glu Glu Asn Arg Gly His Leu He Glu He Arg Asp Leu Phe Gly Pro 
305 310 315 320 

Gly Leu Asp Thr Gin Glu Pro Arg He Val He Leu Gin Gly Ala Ala 
325 330 335 
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Gly lie Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 



Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 

Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu lie 
370 375 380 

Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro lie Arg Gin lie Leu Ser 
385 390 395 400 

Arg Pro Glu Arg Leu Leu Phe lie Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 

Trp Val Leu Gin Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gin 
420 425 430 

Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr lie 
435 440 445 

Leu Pro Glu Ala Ser Phe Leu lie Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 

Asn Leu lie Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 

Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 

Glu Arg Gin Ala lie Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 

Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 

Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 

Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

Glu Gly lie Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 
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Lys His Gly Leu Asp Gly Ala He He Ser Thr Phe Leu Lys Met Gly 
595 600 605 



He Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 

Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 

Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
645 650 655 

Leu Glu Ala Tyr Gly He His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 

Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

He Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 

Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 800 

Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 

Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 



Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 
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Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 



Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 

Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 



Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 

Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

Gly Leu Arg His Pro Ala Cys Lys Leu lie Arg Leu Gly Leu Asp Gin 
945 950 955 960 

Thr Thr Leu Ser Asp Glu Met Arg Gin Glu Leu Arg Ala Leu Glu Gin 
965 970 975 

Glu Lys Pro Gin Leu Leu He Phe Ser Arg Arg Lys Pro Ser Val Met 
980 985 990 

Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser Thr Ser 
995 1000 1005 

Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser His Val 
1010 1015 1020 

Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys He Phe Pro He 
025 1030 1035 1040 

Ala Glu He Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val Glu Leu 
1045 1050 1055 



Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr Lys Pro 
1060 1065 1070 

Leu Gly Thr Asp Asp Asp Phe Leu Gly Pro Glu Gly Asn Val Asp Val 
1075 1080 1085 

Glu Leu He Asp Lys Ser Thr Asn Arg Tyr Ser Val Trp Phe Pro Thr 
1090 1095 1100 
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Ala Gly Trp Tyr Leu Trp Ser Ala Thr Gly Leu Gly Phe Leu Val Arg 
105 1110 1H5 H20 



Asp Glu Val Thr Val Thr He Ala Phe Gly Ser Trp Ser Gin His Leu 
1125 H30 1135 

Ala Leu Asp Leu Gin His His Glu Gin Trp Leu Val Gly Gly Pro Leu 
1140 1145 H50 

Phe Asp Val Thr Ala Glu Pro Glu Glu Ala Val Ala Glu He His Leu 
1155 H60 1165 

Pro His Phe He Ser Leu Gin Gly Glu Val Asp Val Ser Trp Phe Leu 
1170 H75 1180 

Val Ala His Phe Lys Asn Glu Gly Met Val Leu Glu His Pro Ala Arg 
185 1190 H95 1200 

Val Glu Pro Phe Tyr Ala Val Leu Glu Ser Pro Ser Phe Ser Leu Met 
1205 1210 1215 

Gly He Leu Leu Arg He Ala Ser Gly Thr Arg Leu Ser He Pro He 
1220 1225 1230 

Thr Ser Asn Thr Leu He Tyr Tyr His Pro His Pro Glu Asp He Lys 
1235 1240 1245 

Phe His Leu Tyr Leu Val Pro Ser Asp Ala Leu Leu Thr Lys Ala He 
1250 1255 1260 

Asp Asp Glu Glu Asp Arg Phe His Gly Val Arg Leu Gin Thr Ser Pro 
265 1270 1275 1280 

Pro Met Glu Pro Leu Asn Phe Gly Ser Ser Tyr He Val Ser Asn Ser 
1285 1290 1295 

Ala Asn Leu Lys Val Met Pro Lys Glu Leu Lys Leu Ser Tyr Arg Ser 
1300 1305 1310 

Pro Gly Glu He Gin His Phe Ser Lys Phe Tyr Ala Gly Gin Met Lys 
1315 1320 1325 

Glu Pro He Gin Leu Glu He Thr Glu Lys Arg His Gly Thr Leu Val 
1330 1335 1340 

Trp Asp Thr Glu Val Lys Pro Val Asp Leu Gin Leu Val Ala Ala Ser 
345 1350 1355 1360 



59 



Ala Pro Pro Pro Phe Ser Gly Ala Ala Phe Val Lys Glu Asn His Arg 
1365 1370 1375 



Gin Leu Gin Ala Arg Met Gly Asp Leu Lys Gly Val Leu Asp Asp Leu 
1380 1385 1390 

Gin Asp Asn Glu Val Leu Thr Glu Asn Glu Lys Glu Leu Val Glu Gin 
1395 1400 1405 

Glu Lys Thr Arg Gin Ser Lys Asn Glu Ala Leu Leu Ser Met Val Glu 
1410 1415 1420 

Lys Lys Gly Asp Leu Ala Leu Asp Val Leu Phe Arg Ser lie Ser Glu 
425 1430 1435 1440 

Arg Asp Pro Tyr Leu Val Ser Tyr Leu Arg Gin Gin Asn Leu 
1445 1450 



<210> 11 
<211> 4466 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . . (4275) 

<220> 

<223> Description of Artificial Sequence: Synthetic 
Construct 

<400> 11 

atg get ggc gga gec tgg ggc cgc ctg gec tgt tac ttg gag ttc ctg 
Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
15 10 15 

aag aag gag gag ctg aag gag ttc cag ctt ctg etc gec aat aaa gcg 
Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

cac tec agg age tct teg ggt gag aca ccc get cag cca gag aag acg 
His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 

agt ggc atg gag gtg gec teg tac ctg gtg get cag tat ggg gag cag 192 
Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 



48 



96 



144 



60 



egg gec tgg gac 
Arg Ala Trp Asp 
65 

tea ctg tgc gec 
Ser Leu Cys Ala 



ccc tac age cca 
Pro Tyr Ser Pro 
100 

tec acc gca gtg 
Ser Thr Ala Val 
115 

acc cag ggc tea 
Thr Gin Gly Ser 
130 

gga cgc cgc tgg 
Gly Arg Arg Trp 
145 

cca age tec cca 
Pro Ser Ser Pro 



ccc aca tec aca 
Pro Thr Ser Thr 
180 

age eta gca ccc 
Ser Leu Ala Pro 
195 

gat gaa acg tea 
Asp Glu Thr Ser 
210 

gag aaa tea gag 
Glu Lys Ser Glu 
225 

ccc cca cag gcg 
Pro Pro Gin Ala 



eta gec etc 
Leu Ala Leu 
70 

caa gec cag 
Gin Ala Gin 
85 

agt gaa ccc 
Ser Glu Pro 



eta atg ccc 
Leu Met Pro 



gag aga agg 
Glu Arg Arg 
135 

aga gaa ate 
Arg Glu lie 
150 

gac cat gag 
Asp His Glu 
165 

gca gtg ctg 
Ala Val Leu 



aga gag cag 
Arg Glu Gin 



gga att tac 
Gly lie Tyr 
215 

aaa ggc agg 
Lys Gly Arg 
230 

cac acc age 
His Thr Ser 
245 



cat acc tgg 
His Thr Trp 



gaa ggg gca 
Glu Gly Ala 
90 

cac ctg ggg 
His Leu Gly 
105 

tgg ate cat 
Trp lie His 
120 

gtt ttg aga 
Val Leu Arg 



tct gec tea 
Ser Ala Ser 



tct cca age 
Ser Pro Ser 
170 

ggg age tgg 
Gly Ser Trp 
185 

gag get cct 
Glu Ala Pro 
200 

tac aca gaa 
Tyr Thr Glu 

ccc cca tgg 
Pro Pro Trp 



eta cag ccc 
Leu Gin Pro 
250 



gag cag atg ggg 
Glu Gin Met Gly 
75 

ggc cac tct ccc 
Gly His Ser Pro 



tct ccc age caa 
Ser Pro Ser Gin 
110 

gaa ttg ccg gcg 
Glu Leu Pro Ala 
125 

cag ctg cct gac 
Gin Leu Pro Asp 
140 

etc etc tac caa 
Leu Leu Tyr Gin 
155 

cag gag tea ccc 
Gin Glu Ser Pro 



gga tec cca cct 
Gly Ser Pro Pro 
190 

ggg acc caa tgg 
Gly Thr Gin Trp 
205 

ate aga gaa aga 
He Arg Glu Arg 
220 

gca gcg gtg gta 
Ala Ala Val Val 
235 

cac cac cac cca 
His His His Pro 



ctg agg 240 
Leu Arg 
80 

tea ttc 288 
Ser Phe 
95 

ccc acc 336 
Pro Thr 



ggg tgc 384 
Gly Cys 

aca tct 432 
Thr Ser 



get ctt 480 
Ala Leu 
160 

aac gec 528 

Asn Ala 

175 

cag ccc 576 
Gin Pro 

cct ctg 624 
Pro Leu 

gag aga 672 
Glu Arg 



gga acg 720 
Gly Thr 
240 

tgg gag 768 

Trp Glu 

255 



61 



cct tct gtg aga gag age etc tgt tec aca tgg ccc tgg aaa aat gag 816 
Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 

gat ttt aac caa aaa ttc aca cag ctg eta ctt eta caa aga cct cac 
Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 

ccc aga age caa gat ccc ctg gtc aag aga age tgg cct gat tat gtg 
Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 

gag gag aat cga gga cat tta att gag ate aga gac tta ttt ggc cca 
Glu Glu Asn Arg Gly His Leu lie Glu lie Arg Asp Leu Phe Gly Pro 
305 310 315 320 

ggc ctg gat acc caa gaa cct cgc ata gtc ata ctg cag ggg get get 
Gly Leu Asp Thr Gin Glu Pro Arg lie Val lie Leu Gin Gly Ala Ala 
325 330 335 

gga att ggg aag tea aca ctg gec agg cag gtg aag gaa gee tgg ggg 
Gly lie Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 

aga ggc cag ctg tat ggg gac cgc ttc cag cat gtc ttc tac ttc age 
Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 

tgc aga gag ctg gee cag tec aag gtg gtg agt etc get gag etc ate 1152 
Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu He 
370 375 380 



gga aaa gat ggg aca gee act ccg get ccc att aga cag ate ctg tct 
Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro He Arg Gin He Leu Ser 
385 390 395 400 

agg cca gag egg ctg etc ttc ate etc gat ggt gta gat gag cca gga 
Arg Pro Glu Arg Leu Leu Phe He Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 

tgg gtc ttg cag gag ccg agt tct gag etc tgt ctg cac tgg age cag 
Trp Val Leu Gin Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gin 
420 425 430 

cca cag ccg gcg gat gca ctg ctg ggc agt ttg ctg ggg aaa act ata 
Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr He 
435 440 445 



864 



912 



960 



lOOi 



1056 



1104 



1200 



1248 



1296 



1344 



62 



aag cat ggg tta gat ggg gcc ate ate tec ace ttc ttg aag atg ggt 
Lys His Gly Leu Asp Gly Ala lie lie Ser Thr Phe Leu Lys Met Gly 
595 600 605 

att ctt caa gag cac ccc ate cct ctg age tac age ttc att cac etc 
lie Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 

tgt ttc caa gag ttc ttt gca gca atg tec tat gtc ttg gag gat gag 
Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 



1440 



1488 



ctt ccc gag gca tec ttc ctg ate acg get egg acc aca get ctg cag 1392 
Leu Pro Glu Ala Ser Phe Leu He Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 

aac etc att cct tct ttg gag cag gca cgt tgg gta gag gtc ctg ggg 
Asn Leu He Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 

ttc tct gag tec age agg aag gaa tat ttc tac aga tat ttc aca gat 
Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 

gaa agg caa gca att aga gcc ttt agg ttg gtc aaa tea aac aaa gag 1536 
Glu Arg Gin Ala He Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 

etc tgg gcc ctg tgt ctt gtg ccc tgg gtg tec tgg ctg gcc tgc act 1584 
Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 

tgc ctg atg cag cag atg aag egg aag gaa aaa etc aca ctg act tec 1632 
Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 

aag acc acc aca acc etc tgt eta cat tac ctt gcc cag get etc caa 1680 
Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

get cag cca ttg gga ccc cag etc aga gac etc tgc tct ctg get get 1728 
Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

gag ggc ate tgg caa aaa aag acc ctt ttc agt cca gat gac etc agg 1776 
Glu Gly He Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 



1824 



1872 



1920 
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aag ggg aga ggt aaa cat tct aat tgc ate ata gat ttg gaa aag acg 1968 
Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
645 650 655 

eta gaa gca tat gga ata cat ggc ctg ttt ggg gca tea ace aca cgt 
Leu Glu Ala Tyr Gly He His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 

ttc eta ttg ggc ctg tta agt gat gag ggg gag aga gag atg gag aac 
Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

ate ttt cac tgc egg ctg tct cag ggg agg aac ctg atg cag tgg gtc 2112 
He Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 



ccg tec ctg cag ctg ctg ctg cag cca cac tct ctg gag tec etc cac 
Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

tgc ttg tac gag act egg aac aaa acg ttc ctg aca caa gtg atg gee 
Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

cat ttc gaa gaa atg ggc atg tgt gta gaa aca gac atg gag etc tta 
His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

gtg tgc act ttc tgc att aaa ttc age cgc cac gtg aag aag ctt cag 
Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

ctg att gag ggc agg cag cac aga tea aca tgg age ccc acc atg gta 
Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

gtc ctg ttc agg tgg gtc cca gtc aca gat gee tat tgg cag att etc 
Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 800 

ttc tec gtc etc aag gtc acc aga aac ctg aag gag ctg gac eta agt 
Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 

gga aac teg ctg age cac tct gca gtg aag agt ctt tgt aag acc ctg 
Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 



2016 



2064 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 
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aga cgc cct cgc tgc etc ctg gag acc ctg egg ttg get ggc tgt ggc 
Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 

etc aca get gag gac tgc aag gac ctt gee ttt ggg ctg aga gec aac 
Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 

cag acc ctg acc gag ctg gac ctg age ttc aat gtg etc acg gat get 
Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 

gga gec aaa cac ctt tgc cag aga ctg aga cag ccg age tgc aag eta 
Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

cag cga ctg cag ctg gtc age tgt ggc etc acg tct gac tgc tgc cag 
Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 

gac ctg gee tct gtg ctt agt gee age ccc age ctg aag gag eta gac 
Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 

ctg cag cag aac aac ctg gat gac gtt ggc gtg cga ctg etc tgt gag 
Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

ggg etc agg cat cct gee tgc aaa etc ata cgc ctg ggg aaa cca agt 
Gly Leu Arg His Pro Ala Cys Lys Leu lie Arg Leu Gly Lys Pro Ser 
945 950 955 960 

gtg atg acc cct act gag ggc ctg gat acg gga gag atg agt aat age 
Val Met Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser 
965 970 975 

aca tec tea etc aag egg cag aga etc gga tea gag agg gcg get tec 
Thr Ser Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser 
980 985 990 

cat gtt get cag get aat etc aaa etc ctg gac gtg age aag ate ttc 
His Val Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys lie Phe 
995 1000 1005 

cca att get gag att gca gag gaa age tec cca gag gta gta ccg gtg 
Pro lie Ala Glu lie Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val 
1010 1015 1020 
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2592 



2640 



2688 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3072 
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gaa etc ttg tgc gtg cct tct cct gec tct caa ggg gac ctg cat acg 3120 
Glu Leu Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr 
1025 1030 1035 1040 

aag cct ttg ggg act gac gat gac ttt ctg ggg cct gaa gga aat gtg 3168 
Lys Pro Leu Gly Thr Asp Asp Asp Phe Leu Gly Pro Glu Gly Asn Val 
1045 1050 1055 

gat gtt gag ttg att gat aag age aca aac aga tac age gtt tgg ttc 3216 
Asp Val Glu Leu lie Asp Lys Ser Thr Asn Arg Tyr Ser Val Trp Phe 
1060 1065 1070 

ccc act get ggc tgg tat ctg tgg tea gec aca ggc etc ggc ttc ctg 3264 
Pro Thr Ala Gly Trp Tyr Leu Trp Ser Ala Thr Gly Leu Gly Phe Leu 
1075 1080 1085 

gta agg gat gag gtc aca gtg acg att gcg ttt ggt tec tgg agt cag 3312 
Val Arg Asp Glu Val Thr Val Thr He Ala Phe Gly Ser Trp Ser Gin 
1090 1095 1100 

cac ctg gec ctg gac ctg cag cac cat gaa cag tgg ctg gtg ggc ggc 3360 
His Leu Ala Leu Asp Leu Gin His His Glu Gin Trp Leu Val Gly Gly 
1105 IHO 1H5 H20 

ccc ttg ttt gat gtc act gca gag cca gag gag get gtc gee gaa ate 3408 
Pro Leu Phe Asp Val Thr Ala Glu Pro Glu Glu Ala Val Ala Glu He 
1125 H30 1135 

cac etc ccc cac ttc ate tec etc caa ggt gag gtg gac gtc tec tgg 3456 
His Leu Pro His Phe He Ser Leu Gin Gly Glu Val Asp Val Ser Trp 
1140 H45 1150 



ttt etc gtt gee cat ttt aag aat gaa ggg atg gtc ctg gag cat cca 3504 
Phe Leu Val Ala His Phe Lys Asn Glu Gly Met Val Leu Glu His Pro 
1155 H60 H65 

gec egg gtg gag cct ttc tat get gtc ctg gaa age ccc age ttc tct 3552 
Ala Arg Val Glu Pro Phe Tyr Ala Val Leu Glu Ser Pro Ser Phe Ser 
1170 H75 H80 

ctg atg ggc ate ctg ctg egg ate gec agt ggg act cgc etc tec ate 3600 
Leu Met Gly He Leu Leu Arg He Ala Ser Gly Thr Arg Leu Ser He 
1185 1190 1195 1200 

ccc ate act tec aac aca ttg ate tat tat cac ccc cac ccc gaa gat 3648 
Pro He Thr Ser Asn Thr Leu He Tyr Tyr His Pro His Pro Glu Asp 
1205 1210 1215 



66 



att aag ttc cac ttg tac ctt gtc ccc age gac gec ttg eta aca aag 3696 
lie Lys Phe His Leu Tyr Leu Val Pro Ser Asp Ala Leu Leu Thr Lys 
1220 1225 1230 

gcg ata gat gat gag gaa gat cgc ttc cat ggt gtg cgc ctg cag act 3744 
Ala He Asp Asp Glu Glu Asp Arg Phe His Gly Val Arg Leu Gin Thr 
1235 1240 1245 

teg ccc cca atg gaa ccc ctg aac ttt ggt tec agt tat att gtg tct 3792 
Ser Pro Pro Met Glu Pro Leu Asn Phe Gly Ser Ser Tyr He Val Ser 
1250 1255 1260 

aat tct get aac ctg aaa gta atg ccc aag gag ttg aaa ttg tec tac 3840 
Asn Ser Ala Asn Leu Lys Val Met Pro Lys Glu Leu Lys Leu Ser Tyr 
1265 1270 1275 1280 



agg age cct gga gaa att cag cac ttc tea aaa ttc tat get ggg cag 
Arg Ser Pro Gly Glu He Gin His Phe Ser Lys Phe Tyr Ala Gly Gin 
1285 1290 ■ 1295 

atg aag gaa ccc att caa ctt gag att act gaa aaa aga cat ggg act 
Met Lys Glu Pro He Gin Leu Glu He Thr Glu Lys Arg His Gly Thr 
1300 1305 1310 

ttg gtg tgg gat act gag gtg aag cca gtg gat etc cag ctt gta get 
Leu Val Trp Asp Thr Glu Val Lys Pro Val Asp Leu Gin Leu Val Ala 
1315 1320 1325 

gca tea gec cct cct cct ttc tea ggt gca gec ttt gtg aag gag aac 
Ala Ser Ala Pro Pro Pro Phe Ser Gly Ala Ala Phe Val Lys Glu Asn 
1330 1335 1340 

cac egg caa etc caa gec agg atg ggg gac ctg aaa ggg gtg etc gat 
His Arg Gin Leu Gin Ala Arg Met Gly Asp Leu Lys Gly Val Leu Asp 
1345 1350 1355 1360 

gat etc cag gac aat gag gtt ctt act gag aat gag aag gag ctg gtg 
Asp Leu Gin Asp Asn Glu Val Leu Thr Glu Asn Glu Lys Glu Leu Val 
1365 1370 1375 

gag cag gaa aag aca egg cag age aag aat gag gec ttg ctg age atg 
Glu Gin Glu Lys Thr Arg Gin Ser Lys Asn Glu Ala Leu Leu Ser Met 
1380 1385 1390 

gtg gag aag aaa ggg gac ctg gee ctg gac gtg etc ttc aga age att 
Val Glu Lys Lys Gly Asp Leu Ala Leu Asp Val Leu Phe Arg Ser He 
1395 1400 1405 



3888 



3936 



3984 



4032 



4080 



4128 



4176 



4224 
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agt gaa agg gac cct tac etc gtg tec tat ctt aga cag cag aat ttg 4272 
Ser Glu Arg Asp Pro Tyr Leu Val Ser Tyr Leu Arg Gin Gin Asn Leu 
1410 1415 1420 



taa aatgagtcag ttaggtagtc tggaagagag aatccagcgt tctcattgga 4325 
1425 

aatggataaa cagaaatgtg atcattgatt tcagtgttca agacagaaga agactgggta 4385 

acatctatca cacaggcttt caggacagac ttgtaacctg gcatgtacct attgactgta 4445 

tcctcatgca ttttcctcaa g 44 66 



<210> 12 
<211> 1424 
<212> PRT 

<213> Artificial Sequence 
<400> 12 

Met Ala Gly Gly Ala Trp Gly Arg Leu Ala Cys Tyr Leu Glu Phe Leu 
15 10 15 

Lys Lys Glu Glu Leu Lys Glu Phe Gin Leu Leu Leu Ala Asn Lys Ala 
20 25 30 

His Ser Arg Ser Ser Ser Gly Glu Thr Pro Ala Gin Pro Glu Lys Thr 
35 40 45 

Ser Gly Met Glu Val Ala Ser Tyr Leu Val Ala Gin Tyr Gly Glu Gin 
50 55 60 

Arg Ala Trp Asp Leu Ala Leu His Thr Trp Glu Gin Met Gly Leu Arg 
65 70 75 80 

Ser Leu Cys Ala Gin Ala Gin Glu Gly Ala Gly His Ser Pro Ser Phe 
85 90 95 

Pro Tyr Ser Pro Ser Glu Pro His Leu Gly Ser Pro Ser Gin Pro Thr 
100 105 110 

Ser Thr Ala Val Leu Met Pro Trp lie His Glu Leu Pro Ala Gly Cys 
115 120 125 

Thr Gin Gly Ser Glu Arg Arg Val Leu Arg Gin Leu Pro Asp Thr Ser 
130 135 140 
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Gly Arg Arg Trp Arg Glu lie Ser Ala Ser Leu Leu Tyr Gin Ala Leu 
145 150 155 160 



Pro Ser Ser Pro Asp His Glu Ser Pro Ser Gin Glu Ser Pro Asn Ala 
165 170 175 

Pro Thr Ser Thr Ala Val Leu Gly Ser Trp Gly Ser Pro Pro Gin Pro 
180 • 185 190 

Ser Leu Ala Pro Arg Glu Gin Glu Ala Pro Gly Thr Gin Trp Pro Leu 
195 200 205 

Asp Glu Thr Ser Gly lie Tyr Tyr Thr Glu lie Arg Glu Arg Glu Arg 
210 215 220 

Glu Lys Ser Glu Lys Gly Arg Pro Pro Trp Ala Ala Val Val Gly Thr 
225 230 235 240 

Pro Pro Gin Ala His Thr Ser Leu Gin Pro His His His Pro Trp Glu 
245 250 255 

Pro Ser Val Arg Glu Ser Leu Cys Ser Thr Trp Pro Trp Lys Asn Glu 
260 265 270 

Asp Phe Asn Gin Lys Phe Thr Gin Leu Leu Leu Leu Gin Arg Pro His 
275 280 285 

Pro Arg Ser Gin Asp Pro Leu Val Lys Arg Ser Trp Pro Asp Tyr Val 
290 295 300 

Glu Glu Asn Arg Gly His Leu lie Glu lie Arg Asp Leu Phe Gly Pro 
305 310 315 320 

Gly Leu Asp Thr Gin Glu Pro Arg lie Val lie Leu Gin Gly Ala Ala 
325 330 335 

Gly lie Gly Lys Ser Thr Leu Ala Arg Gin Val Lys Glu Ala Trp Gly 
340 345 350 

Arg Gly Gin Leu Tyr Gly Asp Arg Phe Gin His Val Phe Tyr Phe Ser 
355 360 365 

Cys Arg Glu Leu Ala Gin Ser Lys Val Val Ser Leu Ala Glu Leu lie 
370 375 380 

Gly Lys Asp Gly Thr Ala Thr Pro Ala Pro lie Arg Gin lie Leu Ser 
385 390 395 400 



69 



Arg Pro Glu Arg Leu Leu Phe lie Leu Asp Gly Val Asp Glu Pro Gly 
405 410 415 



Trp Val Leu Gin Glu Pro Ser Ser Glu Leu Cys Leu His Trp Ser Gin 
420 425 430 

Pro Gin Pro Ala Asp Ala Leu Leu Gly Ser Leu Leu Gly Lys Thr lie 
435 440 445 

Leu Pro Glu Ala Ser Phe Leu lie Thr Ala Arg Thr Thr Ala Leu Gin 
450 455 460 

Asn Leu lie Pro Ser Leu Glu Gin Ala Arg Trp Val Glu Val Leu Gly 
465 470 475 480 

Phe Ser Glu Ser Ser Arg Lys Glu Tyr Phe Tyr Arg Tyr Phe Thr Asp 
485 490 495 

Glu Arg Gin Ala lie Arg Ala Phe Arg Leu Val Lys Ser Asn Lys Glu 
500 505 510 

Leu Trp Ala Leu Cys Leu Val Pro Trp Val Ser Trp Leu Ala Cys Thr 
515 520 525 

Cys Leu Met Gin Gin Met Lys Arg Lys Glu Lys Leu Thr Leu Thr Ser 
530 535 540 

Lys Thr Thr Thr Thr Leu Cys Leu His Tyr Leu Ala Gin Ala Leu Gin 
545 550 555 560 

Ala Gin Pro Leu Gly Pro Gin Leu Arg Asp Leu Cys Ser Leu Ala Ala 
565 570 575 

Glu Gly lie Trp Gin Lys Lys Thr Leu Phe Ser Pro Asp Asp Leu Arg 
580 585 590 

Lys His Gly Leu Asp Gly Ala lie He Ser Thr Phe Leu Lys Met Gly 
595 600 605 

He Leu Gin Glu His Pro He Pro Leu Ser Tyr Ser Phe He His Leu 
610 615 620 

Cys Phe Gin Glu Phe Phe Ala Ala Met Ser Tyr Val Leu Glu Asp Glu 
625 630 635 640 



Lys Gly Arg Gly Lys His Ser Asn Cys He He Asp Leu Glu Lys Thr 
645 650 655 
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Leu Glu Ala Tyr Gly lie His Gly Leu Phe Gly Ala Ser Thr Thr Arg 
660 665 670 



Phe Leu Leu Gly Leu Leu Ser Asp Glu Gly Glu Arg Glu Met Glu Asn 
675 680 685 

He Phe His Cys Arg Leu Ser Gin Gly Arg Asn Leu Met Gin Trp Val 
690 695 700 

Pro Ser Leu Gin Leu Leu Leu Gin Pro His Ser Leu Glu Ser Leu His 
705 710 715 720 

Cys Leu Tyr Glu Thr Arg Asn Lys Thr Phe Leu Thr Gin Val Met Ala 
725 730 735 

His Phe Glu Glu Met Gly Met Cys Val Glu Thr Asp Met Glu Leu Leu 
740 745 750 

Val Cys Thr Phe Cys He Lys Phe Ser Arg His Val Lys Lys Leu Gin 
755 760 765 

Leu He Glu Gly Arg Gin His Arg Ser Thr Trp Ser Pro Thr Met Val 
770 775 780 

Val Leu Phe Arg Trp Val Pro Val Thr Asp Ala Tyr Trp Gin He Leu 
785 790 795 800 

Phe Ser Val Leu Lys Val Thr Arg Asn Leu Lys Glu Leu Asp Leu Ser 
805 810 815 

Gly Asn Ser Leu Ser His Ser Ala Val Lys Ser Leu Cys Lys Thr Leu 
820 825 830 

Arg Arg Pro Arg Cys Leu Leu Glu Thr Leu Arg Leu Ala Gly Cys Gly 
835 840 845 

Leu Thr Ala Glu Asp Cys Lys Asp Leu Ala Phe Gly Leu Arg Ala Asn 
850 855 860 

Gin Thr Leu Thr Glu Leu Asp Leu Ser Phe Asn Val Leu Thr Asp Ala 
865 870 875 880 

Gly Ala Lys His Leu Cys Gin Arg Leu Arg Gin Pro Ser Cys Lys Leu 
885 890 895 

Gin Arg Leu Gin Leu Val Ser Cys Gly Leu Thr Ser Asp Cys Cys Gin 
900 905 910 
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Asp Leu Ala Ser Val Leu Ser Ala Ser Pro Ser Leu Lys Glu Leu Asp 
915 920 925 



Leu Gin Gin Asn Asn Leu Asp Asp Val Gly Val Arg Leu Leu Cys Glu 
930 935 940 

Gly Leu Arg His Pro Ala Cys Lys Leu He Arg Leu Gly Lys Pro Ser 
945 950 955 960 

Val Met Thr Pro Thr Glu Gly Leu Asp Thr Gly Glu Met Ser Asn Ser 
965 970 975 

Thr Ser Ser Leu Lys Arg Gin Arg Leu Gly Ser Glu Arg Ala Ala Ser 
980 985 990 

His Val Ala Gin Ala Asn Leu Lys Leu Leu Asp Val Ser Lys He Phe 
995 1000 1005 

Pro He Ala Glu He Ala Glu Glu Ser Ser Pro Glu Val Val Pro Val 
1010 1015 1020 

Glu Leu Leu Cys Val Pro Ser Pro Ala Ser Gin Gly Asp Leu His Thr 
025 1030 1035 1040 

Lys Pro Leu Gly Thr Asp Asp Asp Phe Leu Gly Pro Glu Gly Asn Val 
1045 1050 1055 

Asp Val Glu Leu He Asp Lys Ser Thr Asn Arg Tyr Ser Val Trp Phe 
1060 1065 1070 

Pro Thr Ala Gly Trp Tyr Leu Trp Ser Ala Thr Gly Leu Gly Phe Leu 
1075 1080 1085 

Val Arg Asp Glu Val Thr Val Thr He Ala Phe Gly Ser Trp Ser Gin 
1090 1095 HOO 

His Leu Ala Leu Asp Leu Gin His His Glu Gin Trp Leu Val Gly Gly 
105 IHO 1H5 1120 

Pro Leu Phe Asp Val Thr Ala Glu Pro Glu Glu Ala Val Ala Glu He 
1125 1130 H35 

His Leu Pro His Phe He Ser Leu Gin Gly Glu Val Asp Val Ser Trp 
1140 1145 H50 



Phe Leu Val Ala His Phe Lys Asn Glu Gly Met Val Leu Glu His Pro 
1155 1160 H65 
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Ala Arg Val Glu Pro Phe Tyr Ala Val Leu Glu Ser Pro Ser Phe Ser 
1170 1175 1180 



Leu Met Gly lie Leu Leu Arg lie Ala Ser Gly Thr Arg Leu Ser lie 
185 1190 1195 1200 

Pro lie Thr Ser Asn Thr Leu lie Tyr Tyr His Pro His Pro Glu Asp 
1205 1210 1215 

He Lys Phe His Leu Tyr Leu Val Pro Ser Asp Ala Leu Leu Thr Lys 
1220 1225 1230 

Ala He Asp Asp Glu Glu Asp Arg Phe His Gly Val Arg Leu Gin Thr 
1235 1240 1245 

Ser Pro Pro Met Glu Pro Leu Asn Phe Gly Ser Ser Tyr He Val Ser 
1250 1255 1260 

Asn Ser Ala Asn Leu Lys Val Met Pro Lys Glu Leu Lys Leu Ser Tyr 
265 1270 1275 1280 

Arg Ser Pro Gly Glu He Gin His Phe Ser Lys Phe Tyr Ala Gly Gin 
1285 1290 1295 

Met Lys Glu Pro He Gin Leu Glu He Thr Glu Lys Arg His Gly Thr 
1300 1305 1310 

Leu Val Trp Asp Thr Glu Val Lys Pro Val Asp Leu Gin Leu Val Ala 
1315 1320 1325 

Ala Ser Ala Pro Pro Pro Phe Ser Gly Ala Ala Phe Val Lys Glu Asn 
1330 1335 1340 

His Arg Gin Leu Gin Ala Arg Met Gly Asp Leu Lys Gly Val Leu Asp 
345 1350 1355 1360 

Asp Leu Gin Asp Asn Glu Val Leu Thr Glu Asn Glu Lys Glu Leu Val 
1365 1370 1375 

Glu Gin Glu Lys Thr Arg Gin Ser Lys Asn Glu Ala Leu Leu Ser Met 
1380 1385 1390 

Val Glu Lys Lys Gly Asp Leu Ala Leu Asp Val Leu Phe Arg Ser He 
1395 1400 1405 



Ser Glu Arg Asp Pro Tyr Leu Val Ser Tyr Leu Arg Gin Gin Asn Leu 
1410 1415 1420 
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<210> 13 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
<400> 13 

ccgaattcac catggctggc ggagcctggg gc 

<210> 14 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
<400> 14 

ccgctcgagt caacagaggg ttgtggtggt cttg 

<210> 15 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
<400> 15 

cccgaattcg aacctcgcat agtcatactg c 

<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
<400> 16 

gtcccacaac agaattcaat ctcaacggtc 
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<210> 17 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 17 

tgtgatgaga gaagcggtga c 



<210> 18 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 18 

ccgctcgagc aaagaagggt cagccaaagc 30 
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